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ABSTRACT 

Ultra-deep ACS and WFC3/IR HUDF-hHUDF09 data, along with the wide-area 
GOODS-HERS-t-CANDELS data over the CDF-S GOODS field, are used to measure UV col- 
ors, expressed as the UV-continuum slope /3, of star-forming galaxies over a wide range in luminosity 
(0.1L*^3 to 2L*^3) at high redshift (z - 7 to z ~ 4). P is measured using all ACS and WFC3/IR 
passbands uncontaminated by Lya and spectral breaks. Extensive tests show that our j3 mea- 
surements are only subject to minimal biases. Using a different selection procedure, Dunlop et al. 
recently found large biases in their /3 measurements. To reconcile these different results, we simulated 
both approaches and found that (5 measurements for faint sources are subject to large biases if the 
same passbands are used both to select the sources and to measure /3. High-redshift galaxies show 
a well-defined rest-frame UV color-magnitude (CM) relationship that becomes systematically bluer 
towards fainter UV luminosities. No evolution is seen in the slope of the UV CM relationship in the 
first 1.5 Gyr, though there is a small evolution in the zero-point to redder colors from z ~ 7 to z ~ 4. 
This suggests that galaxies are evolving along a well-defined sequence in the Lf/y-color (/3) plane (a 
"star-forming sequence"?). Dust appears to be the principal factor driving changes in the UV color 
P with luminosity. These new larger /? samples lead to improved dust extinction estimates at z 4-7 
and confirm that the extinction is essentially zero at low luminosities and high redshifts. Inclusion of 
the new dust extinction results leads to (i) excellent agreement between the SFR density at z ^ 4-8 
and that inferred from the stellar mass density, and (ii) to higher SSFRs at z > 4, suggesting the 
SSFR may evolve modestly (by factors of ~2) from z ^ 4-7 to z 2. 
Subject headings: galaxies: evolution — galaxies: high-redshift 



1. INTRODUCTION 

One of the biggest frontiers in extragalactic cosmol- 
ogy is to characterize the early build-up and evolution of 
galaxies. It is important for our understanding of early 
gas accretion and star formation in the universe, estimat- 
ing the rate of early metal injection into the IGM, and as- 
sessing the impact of galaxies on reionization. From the 
observations, we already have a good measure of how fast 
galaxies build up through Lyman-Break selections and 
luminosity function (LF) studies reaching all the way to 
z 8 (e.g., Bouwens et al. 2010b; McLure et al. 2010; 
Bunker et al. 2010; Yan et al. 2010). Galaxies show a 
remarkably uniform brightening in their LFs from z 8 
to z ^ 4 (e.g., Bouwens et al. 2011b) and plausibly from 
z ^ 10 (Bouwens et al. 2011a). There is even some evi- 
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dence for very rapid evolution at z > 8 (Bouwens et al. 
2011a; Oesch et al. 2012a). 

Despite general constraints on how the galaxy popu- 
lation builds up with cosmic time, much less is known 
about how individual galaxies grow. Qualitatively, we 
would expect galaxies to build up monotonically in mass, 
metallicity, and dust content as they form stars. Quan- 
tifying how this build up occurs and with what star for- 
mation history is very challenging however. The effects 
of dust, metal, and age on the colors are very similar and 
make it very difficult to disentangle one factor from the 
others. Nonetheless, there is enough information avail- 
able, i.e., the ?7F-to-optical colors (Stark et al. 2009; 
Gonzalez et al. 2010; Labbe et al. 2010b), UV colors 
(e.g., Bouwens et al. 2009; Hathi et al. 2008), and high- 
resolution spectra of high redshift galaxies (e.g.. Stark et 
al. 2010; Vanzella et al. 2009), that significant progress 
should be made in better characterizing galaxies through- 
out the build-up process. 

An important piece of the puzzle in deciphering how 
galaxies build up is provided by the rest-frame UV spec- 
trum and in particular the UV colors. The rest-frame 
UV color provides us with perhaps our best means for 
estimating the dust extinction and star formation rate 
(SFR) for faint z > 3 galaxies, given that other tech- 
niques for probing the SFR tend to only detect the 
most bolometrically luminous galaies (e.g., Bouwens et 
al. 2009; Smit et al. 2012). The UV colors also show a 
systematic dependence on the UV luminosities of star- 
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forming galaxies (o.g., Bouwcns et al. 2009; Bouwcns et 
al. 2010a) and arc much more amenable to direct mea- 
surement than rest-frame UV-optical colors where mid- 
IR (IRAC) photometry is necessary. 

A significant amount of effort has gone into establish- 
ing the J/F-continuum slope distribution at high redshift 
z > 2 and determining its dependence upon redshift 
and luminosity. The earliest analyses were at z ^ 2- 
3 using either ground-based observations (Steidel et al. 
1999; Adelberger & Steidel 2000) or the deep WFPC2 
observations over the HDF North (Meurer et al. 1999). 
Subsequent analyses pushed C/T^-continuum slope mea- 
surements to z ^ 4-6 using Subaru Suprime-Cam, HST 
ACS, or HST NICMOS observations (Ouchi et al. 2004a; 
Papovich et al. 2004; Stanway et al. 2005; Bouwens et 
al. 2006; Hathi et al. 2008). Bouwens et al. (2009) ex- 
tended these previous works by examining the tlV slopes 
as a function of luminosity over the entire redshift range 
z ~ 2-6 - establishing a coherent framework for under- 
standing the observational results at that time. 

The availability of both deep and wide WFC3/IR 
imaging has made it possible to substantially improve 
these early measurements of the J/F-continuum slope. 
These imaging observations allow for accurate measure- 
ments of the C/V"-continuum slopes for large numbers of 
z ^ 4-7 galaxies. Bouwens et al. (2010a) made use of 
the early WFC3/IR observations to examine the UV- 
continuum slope distribution out to z ^ 7 (see also Oesch 
et al. 2010a; Bouwens et al. 2010b; Bunker et al. 2010; 
Finkelstein et al. 2010; Robertson et al. 2010; Dunlop et 
al. 2012; Wilkins et al. 2011). Bouwens et al. (2010a) 
found that very low luminosity z ^ 7-8 galaxies in the 
ultra-deep HUDFOg WFC3/IR field had J7F-continuum 
slopes (3 as steep as —2.5 and plausibly consistent with 
—3 but with uncertainties of ~0.2-0.3. Finkelstein et al. 
(2010) also reported on the colors of ultra-faint z ^ 7-8 
galaxies in the HUDF finding similar steep /3 values, but 
with somewhat larger uncertainties. 

Since the early WFC3/IR campaign over the HUDF 
(Bouwens et al. 2010b; Oesch et al. 2010a) from the 
HUDF09 program (GO 11563: PI Illingworth), the 
amount of deep, wide-area WFC3/IR observations over 
well-known legacy fields has increased dramatically. At 
present, we have ultra-deep WFC3/IR observations over 
the two HUDF09/HUDF05 fields (Bouwens et al. 2011b; 
Oesch et al. 2007, 2010a), wide-area (^145 arcmin^) data 
over the CDF-South GOODS field as a result of the Early 
Release Science (Windhorst et al. 2011) and CANDELS 
(Grogin et al. 2011; Koekemoer et al. 2011) programs, 
and even deeper WFC3 /IR observations over the HUDF 
and HUDF05 fields (Bouwens et al. 2011b). These obser- 
vations greatly improve the luminosity baseline, redshift 
range, and precision with which we can define the UV- 
continuum slope distribution for z ~ 4-7 galaxies. 

In this paper, we take advantage of these new obser- 
vations to establish the distribution of [/^-continuum 
slopes 13 over a wide range in luminosity and redshift. 
These new observations allow us to determine with great 
precision how the C/y-continuum slope /3 distribution de- 
pends upon luminosity, in four distinct redshift intervals. 
This new information puts us in a position to look for a 
possible star-forming sequence of galaxies at high redshift 
and to characterize its evolution with cosmic time. The 



evolution of such a sequence provides useful information 
for better understanding early galaxy build-up. For ex- 
ample, the slope of a /3-luminosity relationship constrains 
how the dust and age of the galaxy population vary as a 
function of luminosity. Scatter in the ^-luminosity rela- 
tionship constrains the overall scatter in the stellar pop- 
ulations or dust extinction of individual galaxies. Fi- 
nally, evolution in the slope and offset of the relation 
with cosmic time gives us clues as to possible changes 
in how galaxies evolve, either in age or dust extinction. 
Independent analyses of the J/V-continuum slope (3 in 
z ^ 5-7 galaxies are provided by Dunlop et al. (2012) 
and Wilkins et al. (2011), but both are based on a much 
smaller, shallowcir set of observations. A somewhat com- 
plementary analysis to the one described here is given by 
Gonzalez et al. (2012) who quantify the changes in UV- 
optical colors as a function of luminosity and redshift. 

We provide a brief overview of the paper here. We be- 
gin with a brief summary of the observational data (§2). 
In §3, we describe the manner in which we construct our 
high-redshift samples from the observational data, mea- 
sure the C/V-continuum slope /3 distribution, correct for 
measurement and selection biases, and present evidence 
for well-defined color-magnitude relation for galaxies in 
the rest-fame UV. In §4, we compare the present UV- 
continuum slope /3 determinations with previous dcrtcsr- 
minations. In §5, we explore the implications of such a 
color-magnitude relation for galaxy growth - using the 
J7y-continuum slopes (3 to infer a luminosity-dependent 
dust correction for galaxies. In §6, we use these extinc- 
tion estimates to rederive the SFR density at z > 4 and 
then compare these results with what one infers from 
the stellar mass density. Finally, in §7, we conclude and 
provide a summary of our primary results. The appen- 
dices include a detailed description of many quantitative 
results and simulations essential for accurate measure- 
ments of the [/V"-continuum slopes. 

Throughout this work, we find it convenient to quote 
results in terms of the luminosity L^^^ Steidel et al. 
(1999) derived at z ~ 3, i.e., Miroo,AB = -21.07, for 
consistency with previoiis work - though we note that 
the Steidel et al. (1999) LF results are now updated 
(Muoo^AB = -20.97 ± 0.14: Reddy & Steidel 2009) 
but still consistent with the previous determination. We 
present our dust extinction estimates as the ratio of the 
bolometric luminosity {IR + UV luminosity: Lm + 
Luv) to the UV luminosity (£c/y), i-C LiujLjjv + 1- 
We refer to the HST F435W, F606W, F775W, F814W, 
F850LP, F098M, F105W, F125W, and F160W bands as 

■B435) Vqqq, 1775, -^814, -2850) ^98) ^105) </l25, and i?i60, 

respectively. Where necessary, we assume fio = 0.3, 
flA = 0.7, Hq = 70km/s/Mpc. We quote all star forma- 
tion rates and stellar masses assuming a Salpctcr (1955) 
IMF. All magnitudes are in the AB system (Oke & Gunn 
1983). 

2. OBSERVATIONAL DATA 

Here we utilize two primary data sets to examine 
the J7F-continuum slope (3 distribution of galaxies from 
z - 7 to z - 4: (1) the ultra-deep ACS+WFC3/IR ob- 
servations over the three HUDF09 fields (Bouwens et al. 
2010b; Bouwens et al. 2011b; Oesch et al. 2010a) and (2) 
the wide-area ACS-t-WFC3/IR observations taken over 
the CDF-South GOODS field as a result of Early Release 
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TABLE 1 

A SUMMARY OF THE OBSERVATIONAL DATA USED TO 
ESTABLISH THE DISTRIBUTION OF l/F-CONTINUUM SLOPES /3 

FROM z ~ 7 TO z ~ 4 (see Figure [T] for the layout of 

THESE DATA WITHIN THE CDF-SOUTH). 



Detection'' PSF FWHM Areal Coverage 
Passband Limits (Scr) (arcsec) (arcmin-^) 





HUDF09 {WFC3/IR HUDF) 




^435 


29.7 


0.09 


4. i 


^606 


30.1 


0.09 


4.7 


^775 


29.9 


0.09 


4.7 




29.4 


0.10 


4.7 


no5 


29.6 


0.15 


4.7 




29.9 


0.16 


4.7 


Hieo 


29.9 


0.17 


4.7 




HUDF09-1 (WFC3/IR P12) 




Veoe 


29.0 


0.09 


4.7 


i775 


29.0 


0.09 


4.7 


2850 


29.0 


0.10 


4.7 


Yl05 


29.0 


0.15 


4.7 


Jl25 


29.3 


0.16 


4.7 


Hl60 


29.1 


0.17 


4.7 




HUDF09-2 (WFC3/IR P34) 






28.8 


0.09 


3.3 




29.9 


0.09 


4.7 


^■775 


29.3 


0.09 


^. i 


T b 
J814 


29.0 


0.09 


3.3 


1-, 

Z850 


29.2 


0.10 


4.7 


Yios 


29.2 


0.15 


4.7 


J125 


29.5 


0.16 


4.7 


H160 


29.3 


0.17 


4.7 




ERS (CDF-S GOODS) 




B435 


28.2 


0.09 


39 


Veoe 


28.5 


0.09 


39 


1775 


28.0 


0.09 


39 


-^814 


28.0<= 


0.09 


39 


2850 


28.0 


0.10 


39 


^098 


27.9 


0.15 


39 


^125 


28.4 


0.16 


39 




28.1 


0.17 


39 




CDF- 


-S CANDELS Deep 




S435 


28.2 


0.09 


66 


^^606 


28.5 


0.09 


66 


«775 


28.0 


0.09 


66 


-^814 


28.8 


0.09 


66 


2850 


28.0 


0.10 


66 


yio5 


28.5 


0.16 


66 


J125 


28.8 


0.16 


66 


^^160 


28.5 


0.17 


66 




CDF-S CANDELS Wide 




S435 


28.2 


0.09 


40 


Veoe 


28.5 


0.09 


40 


«775 


28.0 


0.09 


40 


-^814 


28. 1'' 


0.09 


40 


2850 


28.0 


0.10 


40 


Yi05 


28.0 


0.16 


40 


J125 


28.0 


0.16 


40 


^^160 


27.7 


0.17 


40 



5(T detection limits for our z ~ 4-7 ACS+WFC3/IR selec- 
tions were measured in a 0.35"-diameter aperture. 

Our reductions of the ACS data over the HUDF09-2 field 
include both those observations taken as part of the HUDF05 
(82 orbits) and HUDF09 (111 orbits) programs (see Figure 1 
from Bouwens et al. 2011b). The latter observations add 
^0. 15-0. 4 mag to the total optical depths. 

The depth of the F814W observations vary considerably 
over the ERS and CDF-South CANDELS Wide field, from 
very deep coverage (~28.8 mag) in some regions to essen- 
tially no coverage over a small fraction of the area. Typical 
depths are ~28.0 AB mag (Scr). 




Fig. 1.— Deep WFC3/IR data over the extended CDF-South 
GOODS field that can be used to be establish the f/V-continuum 
slope /3 distribution for star-forming galaxies at z > 4. Ultra- 
deep WFC3/IR observations are available over the three HUDF09 
fields HUDF09, HUDF09-1, and HUDF09-2 (red-shaded regions) 
while moderately deep WFC3/IR observations are available over 
the ~145 arcmin^ ERS (yellow regions) and CANDELS {orange 
regions) areas. The blue and dark blue regions show the position 
of the deep GOODS ACS and ultra-deep HUDF-(-HUDF05 ACS 
observations, respectively. A convenient summary of the observa- 
tional properties of each of these fields is provided in Table [T] 

Science and CANDELS programs (Windhorst et al. 2011; 
Giavalisco et al. 2004; Grogin et al. 2011; Koekemoer et 
al. 2011). A brief summary of the properties of these ob- 
servations is provided in Table [T] Figure [T] shows the lay- 
out of these observations over the CDF-South GOODS 
area. 

2.1. HUDF09 Observations 

The first set of observations we utilize is from the 
HUDF09 program (G011563: PI lUingworth) and in- 
volves ultra-deep WFC3/IR observations over three fields 
that already have ultra-deep ACS observations. These 
fields include the HUDF (Beckwith et al. 2006) and two 
HUDF05 fianking fields (Oesch et al. 2007). These ob- 
servations are primarily of use in establishing the distri- 
bution of C/y-continuum slopes /3 for very faint z ^ 4-7 
galaxies. 

At present, the full two years of WFC3/IR observations 
(all 192 orbits from the G011563 program) have been 
obtained over the three HUDF09 fields. The WFC3/IR 
observations over the HUDF include some 111 orbits 
of observations - while the HUDF09-1 and HUDF09- 
2 fields include 33 orbits and 48 orbits of observations, 
respectively. Our reductions of the WFC3/IR observa- 
tions over these fields are already described in Bouwens 
et al. (2011b) and are conducted using standard proce- 
dures. Particularly important to this process was ensur- 
ing that the geometric distortion and velocity abberation 
were treated properly so that the registration between 
bands and with the ACS observations was very accurate 
(<0.01"). Accurate registration is absolutely essential 



4 



not only for maximizing the accuracy of our cofor mea- 
surements, but also for minimizing the effect that misreg- 
istration might have on scatter in these same color mea- 
surements. The depths of our WFC3/IR observations 
reach to >29 AB mag at 5cr and are presented in detail 
in Table 2l The FWHM of the PSF in the WFC3/IR 
observations is ^0.16". 

Our reductions of the ACS observations over the 
HUDF09-2 area include all available data ~ including 
the 82 orbits that were taken during the execution of the 
original HUDF05 program and an additional 111 orbits 
that were taken in parallel with the HUDF09 WFC3/IR 
observations over the HUDF. The total number of orbits 
per filter over the HUDF09-2 are 10 orbits F435W, 32 
orbits F606W, 46 orbits F775W, 16 orbits F814W, and 
89 orbits F850LP. The total integration time is >50% of 
that available for the HUDF, allowing us to reach within 
<0.4 mag of the HUDF. This provides us with sufficiently 
high S/N levels required to keep contamination in z > 6 
selections to a minimum. A more detailed description of 
this data set is provided by Bouwens et al. (2011b). 

2.2. ERS/CANDELS Observations 

Wide-area WFC3/IR observations allow us to estab- 
lish the [/^/-continuum slope /3 distribution for the rarer, 
higher luminosity sources. These observations are avail- 
able over the CDF-South GOODS field from the Early 
Release Science (Windhorst et al. 2011) and CANDELS 
(Grogin et al. 2011; Koekemoer et al. 2011) programs. 

The WFC3/IR observations from the ERS program 
are distributed over the northern part of the CDF South 
GOODS field (Windhorst et al. 2011: Figure [J). These 
observations consist of 60 orbits of F098M, F125W, and 
F160W observations distributed over 10 distinct 2.1' x 
2.3' pointings, with 2 orbits F098M, 2 orbits F125W, 
and 2 orbits F160W per pointing. The WFC3/IR obser- 
vations extend over ~43 arcmin^ in total, although only 
'--^39 arcmin^ of those observations overlap with the ACS 
GOODS data and can be used here. 

WFC3/IR observations from the CANDELS program 
are distributed over the southern two-thirds of the CDF- 
South GOODS field (106 arcmin^), with the deepest sec- 
tion planned for the central ^66 arcmin^ portion. 3 or- 
bits of F105W, 4 orbits of F125W, and 4 orbits of F160W 
observations are available over the central sections of the 
CDF-South (representing all 10 SNe search epochs to 
February 18, 2012) while only 1 orbit of F105W, F125W 
and F160W observations are available over the southern 
portion. The layout of the CANDELS and ERS obser- 
vations with the CDF-South GOODS field is shown in 
Figure [TJ 

Our reductions of the WFC3 /IR observations over the 
CDF-South are performed in exactly the same manner 
as for the HUDF09 observations. See Bouwens et al. 
(2011b) for a detailed description. As in our HUDF09 
reductions, a 0.06" pixel size was used. For the ACS 
observations over the CDF-South, we used the Bouwens 
et al. (2007) reductions for the F435W, F606W, F775W, 
and F850LP bands. These reductions are comparable to 
the GOODS v2.0 reduction (Giavalisco et al. 2004) but 
take advantage of the substantial SNe follow-up observa- 
tions over the CDF-South (Riess et al. 2007) which add 
~0. 1-0.3 mag of depth to the zgso band. 



TABLE 2 

Lyman-break samples used to measure the distribution of 

(/y-CONTINUUM slopes /3 AS A FUNCTION OF REDSHIFT AND UV 
LUMINOSITY. 



Sample'' 


Field 


Luminosity 
Range'' 




# ot 
Sources 


Z r- 


- 4 


HUDF09 
ERS/CANDELS 


-23 < Muv,AB < 
-23 < Muv.AB < 


-16 
-18 


308 
1524 


Z r- 


- 5 


HUDF09 
ERS/CANDELS 


-23 < Muv,AB < 
— 23 < Muv,AB < - 


-17 
-18.5 


137 
277 


Z ^ 


- 6 


HUDF09 
ERS/CANDELS 


-23 < Muv,AB < 
-23 < Muv.AB < 


-17 
-19 


70 
101 


Z r- 


- 7 


HUDF09 
ERS/CANDELS 


-22 < Muv,AB < 
-22 < Muv,AB < 


-17 
-19 


57c,d 
44 



^ The mean rcdshift we estimate for these samples is 3.8, 4.9, 5.9, 
and 7.0, respectively (Figure|3ll. 

The faint magnitude limit is set by the 5(t depth of the WFC3/IR 
near-IR observations over our search fields. See §3.2. 

These samples include sources from the Bouwens et al. (2011b) 
HUDF09 2 ^ 7 samples and the lowest redshift galaxies in the 
Bouwens et al. (2011b) HUDF09 ^ ~ 8 selection. 

To keep contamination in our faint z ~ 7 selection to a minimum, 
we only include faint Ji25,Afl > 28 sources from the HUDF09 fields 
with the deepest optical data. We therefore only consider sources 
from the HUDF (HUDF09) and the --3 arcmin'^ (~70%) area in 
the HUDF09-2 field with deep optical coverage from the HUDF09 
program (see Figure 1 from Bouwens et al. 2011b). 

We also reduced the new ACS /814-band observa- 
tions over the CDF-South GOODS fields from the CAN- 
DELS+ERS programs to take advantage of the su- 
perb depth available with these data at ~8000A (typ- 
ically 13 orbits over the CANDELS-DEEP region, or 
0.8 mags deeper than the GOODS 1775 band exposures). 
The F814W observations are valuable for controlling for 
contamination in our z 7 selections and for min- 
imizing the uncertainties in our /3 determinations at 
z ~ 4. We included all F814W observations from the 
ERS-hCANDELS programs over the CDF-South. Af- 
ter using public codes (e.g., Anderson & Bedin 2010) 
to correct the raw frames for charge transfer efficiency 
defects and row-by-row banding artifacts, we performed 
the alignment, cosmic-ray rejection, and drizzling with 
the ACS GTO apsis pipehne (Blakeslee et al. 2003). 

3. RESULTS 

In this section, we describe the procedure we use to 
establish the distribution of C/y-continuum slopes /3 ver- 
sus UV luminosity for z ~ 4-7 galaxies. We begin 
with a description of the technique we use for generat- 
ing the source catalogs needed for sample selection and 
J/y-continuum slope measurements (§3.1). We then de- 
scribe our procedure for selecting our high-redshift sam- 
ples (§3.2) and for deriving the t/F-continuum slope /3 
for individual sources (§3.3). In §3.4, we detail the meth- 
ods we use to correct the observed distribution of UV- 
continuum slopes /? for selection and measurement biases. 
In §3.5 we combine our J/y-continuum slope /3 determi- 
nations in the ultra-deep + wide-area data sets to estab- 
lish J/y-continuum slope /3 over a wide-range in redshift 
and luminosity. We then conclude with a discussion of 
the correlation we find between the J/y-slope /3 and the 
UV luminosity (§3.6). 
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Fig. 2. — The two-color Lyman-Break selection criteria used to select our 2 ~ 4, 2 ~ 5, 2 ~ 6, and 2 ~ 7 galaxy samples. These samples 
are used to derive the f/V-continuum slope /3 distribution as a function of redshift and luminosity (see §3.2). The blue lines show the 
expected colors versus redshift for star-forming galaxies with a range of (7y-continuum slopes /3, while the red lines show the colors expected 
for low-redshift interlopers. The green hatched region in the 2 ^ 7 panel shows the position of low-mass stars in 2350 — lios 7^105 ~ 1^125 
space. The black dots show the colors of sources found in our HUDF catalog and provide some indication of the approximation distribution 
of sources in color-color space. The two-color selection windows we use to identify high-redshift galaxies are indicated in gray. These 
selection windows allow for the identification of galaxies over a wide range of f/V-continuum slopes /3 ~ —3 to 0.5. The red arrows show 
the Calzetti et al. (2000) reddening vectors. In addition to the two colour selection criteria shown here, we also utilize a few other criteria 
in establishing our final samples. For example, we enforce a very stringent optical non-detection criteria, especially for our 2 ~ 7 samples 
where we require the optical x^^^ be below a certain threshold (§3.2). 
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TABLE 3 

Wavebands used to derive the UV continuum 
slope for individual galaxies in our 2 ~ 4, 
2 ~ 5, 2 ~ 6, and 2 ~ 7 samples.^ 



Fig. 3. — The predicted redshift distribution for our 2 ~ 4, 2 ~ 5, 
2 ~ 6, and 2 ~ 7 Lyman-break samples from the HUDF. The mean 
redshifts for these samples are 3.8, 5.0, 5.9, 7.0, respectively. The 
three lower redshift selections are essentially the same as those 
presented in Bouwens et al. (2007: though the present selection 
window for 2 ~ 6 galaxies cuts off at 2 > 6.2) while the 2 ~ 7 
selection is similar to that presented in Bouwens et al. (2011b) but 
extends to slightly higher redshifts (§3.2). This maximizes the size 
of our 2 ~ 7 samples while extending our selection to the highest 
redshift possible without suffering significant contamination from 
hya emission or IGM absorption. The redshift distribution for our 
Lyman-break samples from our other search fields are similar to 
those shown here. 

3.1. Source Catalogs 

The procedure for source detection and photometry are 
discussed in many of our previous papers (e.g., Bouwens 
et al. 2007); we just summarize the key steps here. We 
generate catalogs for our samples using the SExtractor 
(Bertin & Arnouts 1996) software run in double image 
mode, with the detection image taken to be the square 
root of image (Szalay et al. 1999: similar to a coad- 
ded image of the WFC3/IR observations) and the mea- 
surement image to be a PSF-matched image from our 
ACS-KWFC3/IR image sets (see below). 

We select each of our high-redshift samples (i.e., z ^ 4, 
z ~ 5, z 6, and z ^ 7) from separately-generated 
source catalogs. This allows us to take advantage of 
the fact that z ^ 4-6 samples can be selected entirely 
(or almost entirely) based on the ACS data (involving 
generally smaller apertures and therefore reducing the 
uncertainties in our color measurements) while z ^ 7 se- 
lections require PSF-matching to the WFC3/IR data|3 
For our z ^ 4-6 catalogs, the first step is to match PSFs 
by smoothing the shorter wavelength images to the res- 
olution of the ACS Z85o-band image. The square root 
of images we use for the SExtractor detection im- 
ages are constructed from the Vgoe* 775-^8501 *775-2850j and 
zgso images for our z 4, z ~ 5, and z ^ 6 selections, 
respectively. For our z ~ 7 catalogs, all the observa- 
tions are PSF-matched to the WFC3/IR Hiqo band. The 
square root of images are constructed from the avail- 
able WFC3/IR imaging observations, e.g., ^105^125-^160 
for our HUDF09 fields, loos •^125-^160 images for the ERS 
observations, and Ji25-ffi60 for the CANDELS observa- 
tions. 

Colors are measured within small scalable apertures 

* For our 2 ~ 6 selections, some use of the available WFC3/IR 
data is made. In addition to satisfying a (4775 — zs5o)ab > 1-3 
criterion, sources in our 2 ~ 6 selection must also satisfy a (2350 — 
■^125) < 0.9 color criterion (§3.2). 





Filters used 


Mean rest-frame 
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Wavelength'' 







HUDF09 Observations 




2 


- 4 


i775 2850 5^105 "^125 


204lA 


2 r- 


- 5 


2850^105 '^125-^160 


I997A 


2 r- 


- 6 


^105 '^125^^^160 


1846A 


2 r- 


- 7 


1^125 ^^^160 


1731 A 






ERS Observations 




2 r- 


- 4 


«775-f8142850y098 A25 


204lA 


2 r- 


- 5 


2850^098 ■.^125-f^l60 


I997A 


2 r- 


- 6 


i098./l25J^160 


1784 A 


2 r- 


- 7 


Jl25^fl60 


1731 A 






CDF-S CANDELS Observations 


2 r- 


- 4 


«775-f8142850yi05..'l25 


204lA 


2 r- 


- 5 


2^850^105 •.^125^^160 


I997A 


2 r- 


- 6 


^105 •^125^^160 


1846A 


2 r- 


- 7 


Jl25^^160 


1731 A 



^ These filters probe the f/V-continuum light of 
sources without contamination from Lyo emission 
or the position of the the Lyman break (being suf- 
ficiently redward of the 1216A). See §3.3. 
^ Geometric mean 

using a Kron (1980) factor of 1.2. These small-aperture 
color measurements are then corrected to total magni- 
tudes in two steps. First, a correction is made for the 
light in a larger scalable aperture, with Kron factor equal 
to 2.5. This correction is made from the square root of 
image to optimize the S/N. Secondly, a correction 
is made for light outside this large scalable aperture - 
using the encircled energies measured for point sources 
(typically a ~0.1-0.15 mag correction). 

Photometry was done using the latest WFC3/IR zero- 
point calibrations (March 6, 2012) which differ by ^0.01- 
0.02 mag from the earlier zeropoints. In addition, a 
correction to the photometry (i.e., E{B — V) — 0.009) 
was performed to account for foreground dust extinction 
from our own galaxy. This correction was based on the 
Schlegel et al. (1998) dust map. 

3.2. Sample Selection 

To select star-forming galaxies at high-redshift, we 
will use the well-established Lyman-Break Galaxy (LBG) 
technique. This technique takes advantage of the 
unique spectral characteristics of high-redshift star- 
forming galaxies, which show a very blue spectrum over- 
all but a sharp cut-off blueward of Lya. It has been 
shown to be very robust through extensive spectroscopic 
follow-up (Steidel et al. 1996; Steidel et al. 2003; Bunker 
et al. 2003; Dow-Hygelund et al. 2007; Popesso et al. 
2009; Vanzella et al. 2009; Stark et al. 2010). 

We will base our high-redshift samples on selection cri- 
teria from previous work on z > 4 galaxies. For our z 4 
i?435 and z ~ 5 Veoe dropout samples, we will apply al- 
most the same criteria as Bouwens et al. (2007), namely, 

(S435 - Vdoe > 1.1) A (B435 - V"606 > (^606 - Z850) + 1-1) 

A(F606 - 2850 < 1-6) 
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Fig. 4. — (upper) Illustration of how we estimate the UV- 
continuum slope for a z ~ 4 galaxy candidate (see §3.3). The 
blue and red points show the observed magnitudes for the galaxy 
in the ACS and WFC3/IR observations, respectively. The -ffieo- 
band is not used for determining the C/V-continuum slope /3 for 
our 2: ~ 4 samples, since the i?i6o-band magnitudes for galaxies 
at the low-redshift end [z ~ 3.0-3.5] of our z ~ 4 samples will 
include a contribution from light redward of the Balmer break. 
The black line shows the C/y-continuum slope we estimate for the 
source fitting to the 1775, zgso, V105, and J125 band photometry. 
The dotted black line is a plausible SED from a stellar population 
model which fits the observed photometry and is shown here to 
show where the major spectral features occur (but we emphasize 
that these SED fits are not used to establish the (7V-continuum 
slopes). Using the full wavelength baseline provided by both the 
ACS and WFC3/IR observations (Table [Sj, we are able to estab- 
lish the l/V-continuum slopes (5 much more accurately than using 
the ACS observations alone, (lower) An illustration of how we 
estimate the f/V-continuum slope /3 for a 2 ~ 6 galaxy. The UV- 
continuum slopes /3 for z ~ 6 galaxies are derived using the full flux 
information in the Vo98 7^105 1 •^125, and Hiqo bands (Table [St. 



for our i?-dropout sample and 

[(Veoe - 1775 > 0.9(^775 - ZS50) + 1.5) V (^606 - «775 > 2))] A 

(Veoe - «775 > 1-2) A (1775 - ^850 < 0.8) 

for our Vgoe-dropout selection (but note we use a 1775 — 
•2^850 < 0.8 color selection instead of the 1775 — zgso < 
1.2 selection used by Bouwens et al. 2007 to minimize 
contamination). For our z ~ 6 Z775-dropout selection, 
we expanded the criteria used in Bouwens et al. (2007) 
to take advantage of the deep near-IR observations from 
WFC3/IR to set limits on the color redward of the break. 
Our z ^ 6 i775-dropout criterion is 

(«775 - ^^850 > 1.3) A (Z850 - ^125 < 0.9) 

Finally, our z ~ 7 zgso-dropout criterion is 

(^850 - Yio5 > 0.7) A (Yi05 - Jl25 < 0.8) A 

(^850 - 5^105 > 1.4(^105 - ^125) + 0.42), 

or 

(^850 - J125 > 0.9) A 
(2850 - ^125 > 0.8 + 1.1( J125 - Hieo)) A 

(Z850 - ^125 > 0.4 -I- l.l(yo98 " J125)) , 

depending upon whether our search field is from the 
HUDF09/CANDELS datasets or the ERS dataset, re- 
spectively. The above selection criteria closely match 
those used in our previous study of the [/T^-continuum 
slope /? at z 7 (Bouwens et al. 2010a). These crite- 
ria are slightly more inclusive than those considered by 
Bouwens et al. (2011b), but this is to allow us to maxi- 
mize the size of our samples and to extend our selection 
to the highest redshift possible without suffering signif- 
icant contamination from Lya emission or IGM absorp- 
tion. In cases of a non-detection in the dropout band, 
we set the flux in the dropout band to be equal to the 
la upper limit. To take advantage of the deep /gi4-band 
observations over the CDF-South to keep contamination 
in our z ^ 7 samples to a minimum, we required that all 
z ~ 7 zg5o-dropouts in our selection to have Igu — J125 
colors greater than 2.0 or to be undetected in the /gi4 
band (at < 1.5a). No attempt is made to select or mea- 
sure J/V-continuum slopes for star-forming galaxies at 
z ^ 8 due to the fact that the J125 — i?i60 color (the 
only available color providing high S/N information on 
the rest-frame UV SED at z ~ 8) is affected by Lyman- 
series absorption and Lya emission at z > 8.1 (but see 
Taniguchi et al. 2011). 

Figure [2] provides a convenient illustration of the ap- 
proximate range in [/T^-continuum slopes /? and redshifts 
selected by the above two-colour criteria. 

We utilize several additional selection criteria in defin- 
ing our final samples. Sources are required to have 
SExtractor stellarity parameters less than 0.8 (i.e., they 
show evidence of being extended) to ensure our samples 
are free of contamination from AGN or low-mass stars. 
Given that >97% of bright sources in Lyman-Break se- 
lections show clear evidence of having extended profiles 
(i.e., are not pointlike), this criterion has almost no effect 
on the overall composition of our samples. 

We also require that sources be undetected in all bands 
blueward of the dropout bands. Sources are rejected if 
they are detected at >2a in a single band, >1.5a in 2 
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bands, or have a Xopt > 3 in the optical bands. We take 
xlpt = SiSGN(/i)(/i/CTj)2 where fi is the flux in band i 
in our smaller scalable apertures, cji is the uncertainty in 
this flux, and SGN(/i) is equal to 1 if /i > and —1 if 
/j < (Bouwens et al. 2011b). As Bouwens et al. (2011b) 
illustrate, a Xopt criterion can be particularly useful for 
minimizing contamination in high redshift samples (see 
also Oesch et al. 2012a, b). 

Table [2] summarizes the properties of the z ~ 4, 
z ^ 5, z ~ 6, and z ^ 7 samples we derive from the 
HUDF09+ERS+CANDELS observations. In total, 308, 
137, 70, and 57 z ^ 4, z ^ 5, z 6, and z ^ 7 galaxies 
are found in the HUDF09 fields and 1524, 277, 101, and 
44 z 4, z 5, z 6, and z 7 galaxies are found in 
the ERS+CANDELS fields. The approximate redshift 
distributions for our z 4, z ~ 5, z ~ 6, and z ~ 7 
selections are shown more explicitly in Figure [3l These 
redshift distributions are as calculated in Bouwens et al. 
(2007) and Bouwens et al. (2011b). 

3.3. Estimating the UV -Continuum Slope 

Establishing the C/y-continuum slope /3 for a source 
(where /? is defined such that fxocX^) requires that we 
have at least two flux measurements of a source in the 
i7F-continuum not significantly affected by Lya or IGM 
absorption (unless both the redshift and Lya flux are al- 
ready known). In practice, this means that we should 
not make use of the flux information in the band im- 
mediately redward of the break - since it is frequently 
contaminated by Lya emission or absorption from the 
IGM. Assuming we have wavelength coverage in the 
-B435^606«775285oyi05>^i25-ffi60 bands, we Can obtain good 
estimates of the [/V^-continuum slope /3 for high-redshift 
sources from z 4 to at least z ~ 7.5. 

There are a few different approaches we could adopt 
in using the available flux measurements in the UV- 
continuum to estimate these slopes. One approach is to 
derive the ?7F-continuum slopes from a fit to all available 
flux measurements in the rest-frame UV. An alternate 
approach is to derive a slope from the flux measurements 
at both ends of a fixed rest-frame wavelength range (e.g., 
as utilized by Bouwens et al. 2009). 

Each approach has its advantages. The first approach 
uses the full fiux information available on each source 
and also takes advantage of a much more extended wave- 
length baseline. This results in much smaller uncertain- 
ties in our /3 determinations, particularly at z ^ 4 and 
z ~ 5 where fiuxes in four separate bandpasses are avail- 
able. The simulations we perform in Appendix B.3 sug- 
gest factors of ~1.5 improvement in the uncertainties on 
^ at z ~ 4 and z ~ 5. On the other hand, the second ap- 
proach has the advantage that all C/V^-continuum slope 
/3 determinations are made using a similar wavelength 
baseline at all redshifts. As a result, even if the SED of 
galaxies in the rest-frame UV is not a perfect power law 
(e.g., from the well-known bump in dust extinction law at 
2175A: Stecher 1965), we would expect /? determinations 
made at one redshift to show no systematic offset rela- 
tive to those made at another, allowing for more robust 
measurements of evolution across cosmic time. 

After some testing, we adopted the approach that uti- 
lizes all the available fiux information to determine the 
J/y-continuum slopes /3. Figure |4] provides an example 



1.5 
-2 



I I I I I I I I I I I I I I I I I I I I I I 
~ 6. = -2 z~7 ~ 

r ^' input ^ ^ ' . 



-2.5 F 

-3 
-3.5 

-1.5 
-2 



-2.5 - 



-3 
■3.5 



Totol Bios 



Selection Volume Bios 

Photometric Error Coupling Bios - 



I I I I I I I I I I I I I I I I I I I I I r 

r /S. = -2.5 z~7 - 

input ^ -1 




Total Bios 



- — — Selection Volume Bios 

Photometric Error Coupling Bios 

... I .... I .... I .... I . 



25 26 



27 28 
n,R (AB) 



29 



Fig. 5. — The (71^-continuum slopes /9 expected to be recovered 
in HUDF09-depth z ~ 7 selections assuming specific input values 
for the f/y-continuum slope f} (§3.4; Appendix B). The results are 
shown as a function of IR magnitude and based on simulations 
with > 10^ galaxies. The expected biases in the recovered UV- 
continuum slopes /3 that r esul t from /3-depcndent selection effects 
(dotted blue lines: Figure [21] and Appendix B.1.1) and from any 
coupling of the photometric errors in the measurement process to 
source selection {dashed blue lines: Appendix B.1.2) are presented 
separately. The solid blue line shows the /3's we recover including 
both effects. The results of similar simulati ons for z ^ A, z 
5, and 2 ~ 6 selections are shown in Figure [22l in Appendix B. 
Overall the (7y-continuum slopes /3 we recover are very similar to 
the input slopes (A/3 < 0.15). One possible exception would the 
faintest luminosity bin in our 2 ~ 7 sample, but we emphasize that 
even in this regime our corrected /3 measurements appear to be 
accurate, given their excellent agreement with other largely bias- 
free estimates we make (see §4.8). This strongly suggests that 
biases in our derived {/V-continuum slope /3 distributions are very 
small overall. 

of such a determination for a z ~ 4 galaxy in our HUDF 
sample and an example of such a determination for a 
z 6 galaxy. The passbands we will consider in de- 
riving the slopes include the i775Z85oyio5'^i25 bands for 
z ~ 4 galaxies, the Z85oyio5t/i25^^i60 bands for z 5 
galaxies, the Yio5'/i25^fi60 bands for z ^ 6 galaxies, 
and the Ji25ffi60 bands for z 7 galaxies. We include 
the /gi4 band in these fits for z 4 galaxies over the 
ERS/CANDELS fields. We replace the Y105 band with 
the Fogs band in these fits for galaxies over the ERS field. 
In selecting these passbands, we explicitly excluded pass- 
bands which could be contaminated by Lya emission, the 
Lyman-continuum break, or fiux redward of the Balmer 
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break. These choices are motivated by our expected red- 
shift distributions for these samples (Figure [3]) . Table |3] 
includes a list of all the bands we use to perform these 
fits. The mean rest- frame wavelengths for our derived 
C/V^-continuum slopes /3 at z ~ 4, z ^ 5, z ^ 6, and 
z ~ 7 are 204lA, 1997 A, 1846A, and 173lA, respec- 
tively. 

In determining the t/l^-continuum slopes /3 from 
the flux information just discussed, we find the UV- 
continuum slope /3 that minimizes the value of x^: 



where fi and ai are the observed fluxes and uncertain- 
ties, respectively, /3 is the best-fit J/y-continuum slope, 
and /o is the best-fit normalization factor. The fluxes 
fi ± (Ti in the above fits are from photometry of the full 
ACS+WFC3/IR data set PSF-matched to the WFC3/IR 
i?i6o-band, using typical Kron-style apertures. 

This photometry is therefore distinct from that used to 
select our z ~ 4, z ~ 5, and z ^ 6 samples - since sources 
in our z ^ 4, z ^ 5, and z 6 Lyman-break samples were 
selected based on catalogs where the PSF matching was 
done to the ACS zgso band. This approach offers two 
clear advantages. (1) By selecting sources based on cat- 
alogs where the PSF-matching is done to the zgsQ-band 
and therefore our photometric apertures are smaller, we 
utilise much higher S /N photometry for the selection pro- 
cess, improving the robustness of our z ^ 4, 5, and 6 se- 
lections. (2) By using different photometry (and smaller 
apertures) to select sources than we use to estimate /? 
(typically involving larger apertures) , we ensure that our 
/3 measurements are not subject to exactly the same noise 
as affects source selection (because the photometric aper- 
tures are different and will have somewhat different noise 
characteristics) . This makes our z ~ 4-6 /3 measurements 
less susceptible (by ~50%) to the biases that arise from 
a coupling of errors in our /3 measurements with similar 
errors in our photometric selection ("photometric error 
coupling bias": Appendix B.1.2). For our z ~ 7 Lyman- 
break sample, however, the photometry used to estimate 
/3 is the same as what we use to select the sources. See 
§3.1 for a description of the photometry. 

In Appendix A, we show that the present approach pro- 
duces similar results to that based on UV colors span- 
ning the wavelength range I6OOA to 22OOA (but with 
smaller errors). For typical sources, the derived /3's agree 
to within A/3 ~0.1 which is about as well as we can de- 
termine P using UV colors (where possible systematics in 
/3 estimates are on the order of ~0.1). The present UV- 
continuum slope (3 estimates should therefore be directly 
comparable with others in the literature (Bouwens et al. 
2009; Meurer et al. 1999; Ouchi et al. 2004a; Stanway 
et al. 2005; Bouwens et al. 2006; Bouwens et al. 2010a; 
Finkelstein et al. 2010) where the wavelength baseline 
I6OOA to 22OOA was typical for /3 determinations. 

In deriving /3's for sources in our samples from the 
available photometry, we tested for significant deviations 
from a pure power-law shape. This is important for en- 
suring that the model we use to characterize the UV- 
continuum SEDs of sources is meaningful. For the aver- 
age source, we found that the observed photometry and 
the best-fit power-law SED differed by less than ~ 0.02 



TABLE 4 

The BIWEIGHT mean f/y-CONTINUUM SLOPE /3 
AND la SCATTER OF GALAXIES, AS A FUNCTION OF 
UV LUMINOSITY.''''' 



l/V-continuum slope /3 
< Muv,AB > Mean'' Icr Scatter'^ 
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^ The biweight mean t/V-continuum slopes /3 pre- 
sented here are also shown on Figure [G] 
^ The slopes presented here have been corrected for 
selection effects and measurement errors (see §3.4 
and Appendix B). The tabulated UV luminosities 
are the geometric mean of the measured luminosi- 
ties in the bands used to establish the UV slope (see 
Table [g. 

Both random and systematic errors are quoted 
(presented first and second, respectively). In §3.5, 
we provide a brief motivation for our estimates of 
the approximate systematic error in the biweight 
mean t/y-continuum slope /3. 

The la scatter presented here has been corrected 
for photometric scatter using the simulations de- 
scribed in Appendix B and therefore should reflect 
the intrinsic la scatter in the C/y-continuum slope 
/3 distribution. Typical uncertainties are ~0.1. 
° It is challenging to establish the intrinsic la scat- 
ter in the /3 distribution in this magnitude interval 
to <0.1 accuracy in cr^ either because of a large pho- 
tometric scatter in the individual /3 measurements 
or because of a limited number of sources. 

mag. Such deviations are not large enough to have a 
sizeable impact on our /3 determinations, shifting the ob- 
served /3 determinations by < 0.1. Such changes are well 
within the uncertainties we quote on our /? determina- 
tions (§3.5). 

3.4. Possible Selection and Measurement Biases 

To establish the actual distribution of f/F-continuum 
slopes /3 from the observations, we must account for the 
effect that object selection and measurement have on the 
observed distribution. These effects can cause the ob- 
served distribution to look very different from what it is 
in reality. Examples of such effects include (1) our LBG 
selection criteria preferentially including those galaxies 
in our samples with the bluest C/l^-continuum slopes and 
(2) photometric noise increasing the spread in the mea- 
sured [/F-continuum slopes /S. 
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Fig. 6. — Determinations of the f/V-continuum slope /3 distribution versus UV luminosity for star-forming galaxies at z ~ 4 {upper left), 
z ^ 5 (upper right), z ~ 6 {lower left), and 2 ~ 7 {lower right). The blue points show the t/V-continuum slope /3 determinations for 
individual sources in our samples. The large blue squares show the biweight mean l/V-continuum slope /3 determinations in each magnitude 
interval and error on the mean. We take the absolute magnitude of each galaxy in the UV to be equal to the mean absolute magnitude of 
that source in all the HST bands that contribute to its C/\/-continuum slope determinations (Table [Sjl. The blue solid line in each panel 
shows the best-fit relationship (see Table [5l and Figure [7] for the best-fit parameters). The z ~ 7 panel also includes a fit to the binned 
determinations but keeping the slope of the line fixed to the average dp / dMuy found for our z ~ 4, z ~ 5, and z ~ 6 samples (dashed 
line). The dotted black line in the z ~ 5, z ~ 6, and z ~ 7 panels show the best-fit relationship at z ~ 4 and is included for comparison. 
The vertical dotted line indicates the absolute UV magnitude for ij*_3 galaxies. For our z ~ 4 and z ~ 5 samples, the observed dispersion 
in (7y-continuum slopes /3 about the mean relation {blue line) is relatively small (see also Table HI and Figure [T5J. The intercept to the 
C/V-continuum slope /3 versus luminosity relationship appears to become slightly bluer towards higher redshift. However, the slope of the 
fyV-continuum slope f} versus luminosity docs not show any significant evolution as a function of redshift (see also Figure [Tj. 



To correct for these selection and measurement biases, 
we follow a very similar procedure to what we used in our 
first major study of the f/V^-continuum slope /3 distribu- 
tion at high redshift (Bouwens et al. 2009). We create 
mock catalogs of galaxies, generate artificial images for 
each source in our catalogs, add these sources to the real 
observations, and then reselect the sources and measure 
their properties in the same way as with the real obser- 
vations. We then determine the approximate biases that 
we would expect in the mean C/T^-continuum slope and 
Icr scatter due to source selection or photometric scatter. 
We compute these biases as a function of magnitude for 
each of our data sets. To ensure that we are able to de- 
termine the relevant biases to high precision, we repeat 
these experiments for >10^ artificial galaxies. 

Appendix B provides a detailed discussion of the sim- 
ulations we use to estimate the likely biases in the UV- 
continuum slope j3 distribution and establish the needed 
corrections. 



Figure[5]shows the results of these simulations for z ~ 7 
selections. The effect of biases related to the LBG selec- 
tion itself (Figure [^T|) and due to a possible coupling 
between the selection and measurement processes ( "pho- 
tometric error coupling bias") is shown explicitly. It is 
immediately clear from this figure that we can success- 
fully recover the input values of the ?7y-continuum slopes 
P and do not expect especially large biases in the derived 
slopes. We include similar panels for our z 4, z ~ 5, 
and z ~ 6 selections in Figure [22] of Appendix B. 

An important check to perform in assessing the overall 
quality of our C/T^-continuum slope determinations here 
is to compare the results from our deep data with the 
results from our wide data. Such a check is provided 
in Figure [25] of Appendix B, and it seems clear that the 
resuhs from the HUDF09 and ERS-^CANDELS data sets 
are in good agreement (A/3 < 0.1-0.2) over the luminosity 
range where they overlap and have good statistics. This 
suggests that the C/F-continuum slope (3 distributions we 
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recover are accurate and free of any sizeable biases. 

3.5. Establishing the j3 Distribution over a Wide 
Luminosity Baseline 

We can utilize the /7y-continuuni slopes /3 measured 
for individual sources (§3.3) to establish the distribution 
of [/^-continuum slopes /3 as a function of luminosity for 
each of our z 4-7 Lyman-Break Galaxy samples. 

To establish this distribution while properly account- 
ing for the relevant selection and measurement biases, 
we analyze the t/l^-continuum slope distribution for each 
data set and Lyman-Break sample separately. We deter- 
mine the mean [/^-continuum slope and Icr scatter. We 
correct the mean and scatter for the relevant biases (Ap- 
pendix B and §3.4). Typical corrections in the mean UV- 
continuum slope /3 are ;$0.1 in general; the corrections 
do however reach values of A/3 0.1 near the selection 
limit of our shallow and deep probes. 

When determining the mean and scatter for each dis- 
tribution, we use the biweight mean Cbi 



and biweight scale Sbi 

^ [SK|<i(^.-M)^(l-^f)f^' 

" IE|„.|<i(l-^^)(l-5z^?)l 

where m {xi - M)/(c MAD), MAD = median(|xi - 
M|), M is the median of the the data, n is the 

number of sources summed over, and c is the "tuning 
constant" (Beers et al. 1990). c is taken to be equal to 6 
in computing the biweight mean and 9 in computing the 
biweight scale. Use of robust statistics like the biweight 
mean and scale is valuable, given that a small fraction of 
the sources in our samples may be contaminated by light 
from nearby sources or may lie outside the target redshift 
range. Biweight mean determinations of /3 are similar 
to median determinations of j3 (median difference A/3 
~ 0.01 with ~0.04 scatter), but somewhat bluer (A/3 ^ 
0.1) than mean determinations (due to the /3 distribution 
having a somewhat extended tail towards red /3's and the 
biweight mean de- weighting the tail). 

For the absolute magnitude of each galaxy in the UV , 
we use the mean absolute magnitude of that source in 
all the HST bands that contribute to its f/F-continuum 
slope determinations (Table [3]). By using the geomet- 
ric mean for this luminosity, we aim to avoid giving too 
much weight to the bluer or redder bands in defining 
this luminosity and hence artificially creating a correla- 
tion between the derived t/l^-continuum slope /3 and UV 
luminosity. 

Figure [5] shows the t/F-continuum slope /? distribution 
versus the magnitude for our z 4, z 5, z ~ 6, and 
z ~ 7 samples. Scatter in the (5 distribution is relatively 
modest for our z ~ 4 and z ^ 5 samples, but is larger 
for our z ~ 6 and z ~ 7 samples. This is the result of 
the fact that our z 4-5 samples use a larger number 
of passbands and larger wavelength baseline to measure 
the t/T^-continuum slope /3. 

In Table IH we present our corrected determinations of 
the biweight mean J/y-continuum slope and la scatter 



as a function of UV luminosity, for our z 4, z 5, 
z ^ 6, and z 7 samples. We also include an estimate 
of the uncertainties in the biweight mean f/y-continuum 
slopes that result from the measurement uncertainties, 
intrinsic scatter in the /3 distribution, and small number 
statistics. 

For the systematic error on our [/^/-continuum slope 
measurements, we conservatively adopt values of 0.10- 
0.28. The dominant component of this error comes from 
uncertainties in our photometry. Allowing for small er- 
rors in the photometric zeropoints, aperture corrections, 
and PSF-matching, we estimate a maximum error of 0.05 
mag in our fiux measurements. This translates into pos- 
sible systematic errors of 0.10, 0.10, 0.14, and 0.28 in the 
P measurements at z ~ 4, z ~ 5, z ~ 6, and z ~ 7. 

We also allow for small systematic errors in our UV- 
continuum slope /3 measurements as a result of differ- 
ences in the wavelength baseline we use to derive /3 (Ta- 
ble [3]). These differences would have an effect on the /3's 
we derive from the available photometry, if the UV SEDs 
differed substantially from a pure power- law form (i.e., 
/^). The tests we perform in Appendix A suggest the rel- 
evant systematic errors are small A/3 < 0.13. Similar /3's 
are found from fits using the full J/F-continuum (§3.3 
and Figure |4]) as are found using a smaller wavelength 
range. 

3.6. Dependence of j3 on Luminosity 

We observe a clear trend in the C/T^-continuum slope /3 
as a function of UV luminosity in all four LBG samples, 
such that the [/l^-continuum slope /3 becomes progres- 
sively bluer at fainter luminosities (Figure |6] and [25]). 
Such trends in /3 had already been identified by Bouwens 
et al. (2009) in their analyses of z ~ 2.5 and z ~ 4 galaxy 
samples (see also Meurer et al. 1999; Labbe et al. 2007; 
Figure 8 of Overzier et al. 2008) and by Bouwens et al. 
(2010a) in their analyses of z ~ 5-7 galaxies (see also 
Wilkins et al. 2011). The observed luminosity depen- 
dence of /3 is thought to be due to a change in the dust 
content and perhaps the age of galaxies as a function of 
luminosity (Labbe et al. 2007; Bouwens et al. 2009). It is 
likely to be a manifestation of the well-established mass- 
metallicity relationship seen at a wide variety of redshifts 
(e.g., Tremonti et al. 2004; Erb et al. 2006a; Maiolino et 
al. 2008). 

The C/y-continuum slope /3 shows an approximately 
linear relationship on the magnitudes Mjjv of the sources 
(Figure ini), and therefore it makes sense for us to model 
this relationship using a first-order polynomial and de- 
termine the best- fit slopes d/S/dMiJv and intercepts 

/3muv = -19.5- 

In fitting a line to our t/y-continuum slopes, we use a 
finer binning (i.e., 0.5 mag) than shown in Figure |6] to 
minimize the impact of the binning scheme on the best- 
fit slopes and intercept. The results of our fits to the 
biweight means are shown as blue lines on Figure |6l and 
it is clear that the mean /3's are well fit by the linesO The 
fits exhibit a similar dependence on the UV magnitude 
Mjjv at each redshift. The intercept to the lines is also 
similar at all redshifts but appears to evolve monotoni- 

^ There may nonetheless be weak evidence in the results from our 
z ^ 4 and 2^5 samples that the dependence of /3 on luminosity 
is weaker faintward of —18 mag (see Figure [6]|. 
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TABLE 5 

Best-fit slopes and intercepts to the 
fv-continuum slope /3 to uv luminosity 
relationship (§3.6: see also figure o 



Dropout 


Mean 






Sample 


Redshift 




dp/dMuv 


S435 


3.8 


-2.00±0.02±0.10 


-O.llitO.Ol'' 


Veoe 


5.0 


-2.08±0.03±0.10 


-0.16±0.03=' 


1775 


5.9 


-2.20±0.05±0.14 


-0.15±0.04=' 


2850 


7.0 


-2.27±0.07±0.28 


-0.21±0.07 


U300'' 


2.5 


-1.70±0.07±0.15 


-0.20±0.04 



While the dfl/dMuv slopes for our 2: ~ 4, 5, and 6 sam- 
ples are quite similar overall, these slopes show even better 
agreement if we consider the correlations over the same lu- 
minosity range, i.e., excluding /3 determinations faintward 
of —17 AB mag. In this case, we find a dfS/dMi/y slope of 
— 0.13±0.02 for our 2: ~ 4 sample. 
^ From Bouwens et al. (2009). 

cally with cosmic time. The best-fit parameters - slope 
and intercept - we derive for these lines are presented 
in Table [5] and Figure [T) These determinations are in 
excellent agreement with previous work, as we discuss in 
§4. 

4. COMPARISON TO PREVIOUS RESULTS 

In the present section, we compare the present obser- 
vational results on the [/V^-continuum slope /3 with those 
previously obtained in the literature (Figure[8]). The goal 
is to assess the robustness of the present observational re- 
sults and to give some perspective on which trends are 
gaining widespread observational support. 

4.1. Comparison with Bouwens et al. 2009 (z ^ 3-6) 

Before the WFC3/IR camera on HST became op- 
erational, Bouwens et al. (2009) made use of the 
ACS+NICMOS observations to quantify the distribution 
of t/l/-continuum slopes /3 for star-forming galaxies over 
range in redshift {z ~ 2.5-6) and luminosities. Bouwens 
et al. (2009) derive /3 directly from the UV colors (e.g., 
as in Appendix A). How do the present [/T^-continuum 
slope determinations compare with those from Bouwens 
et al. (2009)? Both old and new results are shown in 
Figure |S1 Comparing the [/V^-continuum slope measure- 
ments made on the identical sources in the two data sets 
(old ACS-fNICMOS data vs. the new WFC3/IR obser- 
vations), we find reasonable agreement, with mean offsets 
l3wFC3/iR - Pacs+nicmos of Only -0.10, 0.04, 0.09 in 
the derived /3's at z ~ 4, z ~ 5, and z ^ respectively. 
The biweight mean [/^-continuum slopes (3 found here 
are also in good agreement with the slopes derived by 
Bouwens et al. (2009). At z 4 and z ~ 5, the present 
biweight mean /3's are just ^0.26 bluer and ~0.16 red- 
der than the values found in Bouwens et al. (2009); the 
present biweight mean /3's at z 6 show no average 
shift at all relative to the values found by Bouwens et al. 
(2009). A/3 ~ 0.1 of the differences result from our use 
of the more robust biweight means to express the central 
/3's, so the agreement is quite good overall. 

4.2. Comparison with Bouwens et al. 2010a (z ^ 5-7) 

Bouwens et al. (2010a) took advantage of the first-year 
WFC3/IR observations over the HUDF and the ERS 
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Fig. 7. — Slope and intercept of the L'^V-continuum slope- 
luminosity relationship as a function of redshift (§3.6: see also 
Table[5]l. (upper) Intercept of the (7V-continuum slope-luminosity 
relationship, as a function of redshift {large blue circles). Previous 
determinations of the intercept to the /J-luminosity relationship 
are also shown {open black circles: Bouwens et al. 2009). The 
shaded lavendar region represent the predictions from the cosmo- 
logical hydrodynamical simulations of Finlator et al. (2011: §5.1). 
We use —19.5 AB mag as the intercept, because of the substantial 
C/V-continuum slope measurements there at all redshifts. An ap- 
parent reddening of the (JV-continuum slope /3 at —19.5 AB mag 
(0.25L*_3) with cosmic time is observed, from z ~ 7 to 2 ~ 2.5. 
{lower) Slope of the (71^-continuum slope-luminosity relationship, 
as a function of redshift {large blue circles). Previous determina- 
tions of this dependence on luminosity at z ~ 1 — 3 (blue shaded 
region: Labbe et al. 2007) and at z ~ 2.5 and z ~ 4 {open black 
circles: Bouwens et al. 2009) are also shown. The shaded lavender 
region is as in the upper panel. A very similar dependence of f5 on 
UV luminosity is observed over the entire redshift range 2 ~ 7 to 

2 ~ 1. 

WFC3/IR observations to quantify the t/F-continuum 
slope /3 distribution to higher redshifts (z ~ 7) and 
lower luminosities. How do the present determinations of 
the [/P^- continuum slope /3 compare with Bouwens et al. 
(2010a)? At high luminosities, we find excellent agree- 
ment, with both studies preferring mean ?71^- continuum 
slopes /3 of —2 (see Figure |H]). At lower luminosities, 
however, we now find somewhat redder values of the UV- 
continuum slope /3, i.e., A/3 ~ 0.2 — 0.4 than found by 
Bouwens et al. (2010a). The observed differences in the 
mean C/y-continuum slope /3 seem to have resulted from 
both the small number of sources in previous samples 
and uncertainties in the photometry of faint sources in 
the early HUDF09 data. We remark that the current 
WFC3/IR observations of the HUDF from the HUDF09 
WFC3/IR program are approximately twice as deep as 
what Bouwens et al. (2010a) used. 

4.3. Comparison with the literature: Does /3 correlate 
with redshift? 

In the present analysis, we find that the t/l^- continuum 
slope /3 shows a correlation with the redshift of galaxies, 
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Fig. 8. — Comparison of the biweight mean {/V-continuum slopes 
/3 found here [solid blue circles) for luminous (L*/— 21 AB mag: 
top panel) and lower luminosity (0.1Z/*_g/— 18.5 AB mag: bot- 
tom panel) galaxies with values in the literature. Generally we 
only compare with the mean (yV-continuum slope determinations 
from the literature - since a biweight mean slope is not specified. 
However, as described in Figure Q the biweight mean values for /3 
tend to be somewhat bluer {A/3 ~ 0.1) than the mean values. In- 
cluded on this figure are (7\/-continuum slope determinations from 
Bouwens et al. (2010a: red open circles), Bouwens et al. (2009: 
red open squares), Bouwens et al. (2006: magenta triangle), Adel- 
berger & Steidel (2000: open black star), Ouchi et al. (2004a: open 
blue pentagon), Stanway et al. (2005: open cyan square), Hathi 
et al. (2008: open green squares), Dunlop et al. (2012: solid green 
squares), Wilkins et al. (2011: blue triangles), Castellano et al. 
(2012: solid black triangles), and Finkelstein et al. (2012: solid 
black squares). In general, we find good agreement with our pre- 
vious (7V-continuum slope determinations (Bouwens et al. 2009, 
2010a) although the current results are a little redder at lower 
luminosities. There is a moderate amount of scatter in the obser- 
vational results at lower luminosities (see §4). 



with higher redshift galaxies being bluer. This evolution 
is evident both in Figure [6] (compare the solid lines in 
the z^5, z^7 panels with the dotted lines) and 

in Figure [T] 

How does the correlation we find compare with other 
studies? A brief summary of the evidence for this red- 
shift dependence is provided in Figure El and there is a 
clear trend from bluer f/T^-continuum slopes /? at z ~ 6-7 
to redder slopes at z ~ 2-4. Essentially all studies of the 
[/^-continuum slope over the range z ~ 3-7 (Lehnert & 
Bremer 2003; Stanway et al. 2005; Bouwens et al. 2006; 
Bouwens et al. 2009; Bouwens et al. 2010a; Wilkins et 
al. 2011; Finkelstein et al. 2012) find evidence for this 
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Fig. 9. — The dependence of the ?7V-continuum slope /3 on UV 
luminosity Muy at z ~ 4-5, as determined by different analyses 
in the literature. The lines show the observed jS-Muv relations 
from Lee et al. 2011 {dashed black), Bouwens et al. 2009 {solid 
red), Wilkins et al. 2011 {dashed green), Ouchi et al. 2004 {dashed 
magenta), Finkelstein et al. 2012 {solid magenta), Overzier et al. 
2008 {dashed gray), Castellano et al. 2012 {solid green), Dunlop 
et al. 2012 {solid black), and the present work {blue). The blue 
squares are the /3 determinations at z ~ 4 from the present work. 
Error bars are Itr. Also included on this figure in parentheses is the 
slope of the f}-MuY relationship df} / dMuv determined in different 
analyses. In several cases (e.g., Ouchi et al. 2004; Wilkins et al. 
2011) shown here, we derived the plotted relations and dfi / dMuy 
slopes from the individual /3 determinations in those papers. The 
error bar in the lower left illustrates the approximate systematic 
uncertainties in previous /3 measurements. Similar luminosity de- 
pendencies are found at 2 ~ 1-3 by Labbe et al. (2007) and Bouwens 
et al. (2009: see FigureO. A large number of independent analyses 
have found evidence for a similar correlation between UV luminos- 
ity and (9 (see §4.4). Brighter galaxies are consistently found to be 
redder in their /3's and fainter galaxies are found to be bluer. 

evolution of j3 with redshift. The only exception to this 
is the recent study of Dunlop et al. (2012) who find no 
dependence (but Dunlop et al. 2012 do note that a com- 
parison of their results with those at 2; 3 does argue for 
evolution); we discuss the Dunlop et al. (2012) results in 
§4.5. 

4.4. Comparison with the literature: Does (3 correlate 
with UV luminosity? 

As we discuss in §3.6, we find that the [/l^-continuum 
slope (3 shows a clear correlation with the rest-frame UV 
luminosity of galaxies, with lower luminosity galaxies be- 
ing bluer at all redshifts. A good illustration of this cor- 
relation is provided in Figure [SI The best-fit relationship 
(solid blue line) shows almost exactly the same depen- 
dence for each high redshift sample. The uniformity of 
the slope and modest variation as a function of redshift 
is also clearly illustrated in Figure [71 The uniformity 
extends even to our 2; ~ 4 samples. While /? in these 
samples appear to show a somewhat weaker dependence 
on luminosity than our other samples, a slightly steeper 
dependence is found, i.e., dp/dMuy = —0.13 ± 0.02, if 
we use the same luminosity baseline as our 2; ~ 5 and 
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~ 6 samples (excluding sources faintward of —17 AB 
mag). 

In general, the correlation we find agrees very well with 
most previous studies. Figure [9] shows different (3 deter- 
minations as a function of luminosity at z ~ 4-5 (Ouchi 
et al. 2004; Overzier et al. 2008; Bouwens et al. 2009; Lee 
et al. 2011; Wilkins et al. 2011; Dunlop et al. 2012)13 
Despite some dispersion in the precise values of the /? 
measurements and some small variance in the best-fit 
slopes dp / dMjjv , nearly all published determinations of 
the [/T^-continuum slope /3 at z ^ 4-5 find bluer values 
for /3 at lower luminosities, with the same luminosity de- 
pendence. Two recent analyses that found minimal or 
no correlation of /3 with luminosity are those of Dunlop 
et al. (2012) at z ~ 5-7 and Finkelstein et al. (2012) at 
z ^ 4-5. We discuss the Dunlop et al. (2012) results in 
§4.5 and the Finkelstein et al. (2012) results in §4.7. 

Similar correlations with luminosity are found in the 
J/y-continuum slope results at z ~ 2-3 (Labbe et al. 
2007; Bouwens et al. 2009; Sawicki 2012) and in the UV- 
optical colors (Papovich et al. 2001; Labbe et al. 2007; 
Gonzalez et al. 2012). Again, two analyses did not find a 
correlation of /? with luminosity, and those are the Adel- 
berger & Steidel (2000) and Reddy et al. (2008) analyses 
at z ~ 2-3. In these cases, not only was the luminosity 
baseline too short to provide much leverage for quantify- 
ing this correlation, but also the luminosity range probed 
was around L* which is where the dispersion in dust 
properties relative to UV luminosity is at a maximum 
(e.g.. Figure 13 of Reddy et al. 2010). However, when 
one adds the Adelberger & Steidel (2000) samples to the 
faint (^26-27 mag) z ~ 2.5 samples observed by Bouwens 
et al. (2009), a strong correlation is present (Figure 3 of 
Bouwens et al. 2009: see also Figure 5 of Sawicki 2012). 
Taken together these results indicate that there is also a 
clear trend with luminosity at z ^ 2-3. 

4.5. Comparison with Dunlop et al. 2012 (z ^ 5-7) 

Dunlop et al. (2012) use the WFC3/IR observations 
over the IIUDF09 and ERS observations to quantify the 
f/y-continuum slope distribution at z ~ 5-7. They select 
sources using a photometric redshift procedure and then 
measure their C/y-continuum slopes /3 from the UV col- 
ors. In both respects, their procedure differs from that 
followed here (§3.2 and §3.3). Overall, the individual 
J/y-continuum slope /3 measurements of Dunlop et al. 
(2012) are in reasonable agreement with the present re- 
sults (see Figure [8]). The mean /3 they derive for bright 
z ~ 5 galaxies is A/3 ~ 0.3 bluer and the mean (3 they 
derive for faint z ~ 6 galaxies is A/3 ^ 0.4 redder. Given 
the quoted uncertainties on the measurements, the dif- 
ferences are not particularly significant. 

The main differences arise when we look at the trends 
in the C/y-continuum slope /3 with redshift and lumi- 

The slopes we derive from Labbe et al. (2007) are based 
upon their results at 1700Aand 3600Ain their Figure 3. Previ- 
ously, Bouwens et al. (2009) had estimated a slope of ^ —0.25 
in the (/V-continuum slope vs. Mjjv relationship at z ~ 1-2.7 
from the Labbe et al. (2007) results based on a shorter 1700Aand 
2200A wavelength baseline (see Figure [7|- The reason we use a 
more extended wavelength baseline to estimate /3 than Bouwens 
et al. (2009) had used is to allow for a fair comparison with the 
present /3 results (which use an extended wavelength baseline to 
estimate f): see §3.3). 



nosity. Dunlop et al. (2012) find a mean [/l^-continuum 
slope /3 of galaxies equal to ^ —2.1, with no significant 
dependence on luminosity or redshift. This is in contrast 
to the strong correlation we find of /3 with both lumi- 
nosity (Figure [6] and [71) and redshift (Figure [7]). Their 
results are also inconsistent with the trends reported in 
the literature (§4.3-§4.4; Figure [7] and il) . 

We have attempted to understand the source of this 
difference both by a qualitative assessment of some is- 
sues that we know are important for obtaining reliable re- 
sults and by a quantitative assessment using simulations 
(see Appendix D and §4.6). First, we remark that Dun- 
lop et al. (2012) make no attempt to correct their mean 
/3's for the fact that galaxies with bluer ?7T^-continuum 
slopes (3 are easier to select than galaxies with redder 
J/y-continuum slopes fi. This effectively biases their 
[/^-continuum slopes /3 to bluer values. It is a fairly 
straightforward process to correct for this issue (e.g., we 
describe such a correction in Appendix B.1.1 and Fig- 
ure [5]where we show its effect) FT Second. Dunlop et al. 
(2012) use an overlapping set of information both to se- 
lect sources and to measure the J/y-continuum slopes /3. 
This biases their results (see Appendix D), though the 
magnitude of this bias is mitigated by their consideration 
of only the brightest sources. 

Third, one further limitation of the Dunlop et al. 
(2012) analysis is their exclusion of the lowest luminosity 
sources. This significantly reduces the leverage they have 
to quantify trends in the mean [/l/-continuum slope as a 
function of luminosity. Dunlop et al. (2012) exclude faint 
sources, because their simulations suggested to them that 
/3 could not be measured in an unbiased way. However, 
as we show in Appendix B and D, we are able to obtain 
reliable measurements (see also §4.6). Not only can we 
recover the mean [/^-continuum slope for our samples 
to very faint magnitudes with excellent accuracy, but we 
can recover these slopes for samples selected using a pho- 
tometric redshift procedure, if the information used for 
source selection is clearly separated from that used to 
measure /3 (see Figure [TOll. 

As mentioned above, these three issues led us to con- 
sider a more quantitative evaluation of the Dunlop et 
al. (2012) procedure so that we could better understand 
what was happening with their measurement of /?. This 
is discussed in more detail in the next section. 

4.6. Can /3 be measured in a largely bias-free way for 
low S/N sources? 

As discussed above, a key question that has arisen in 
recent papers is whether it is possible to determine the 
J/y-continuum slope /3 distribution to very low luminosi- 
ties with small biases. Dunlop et al. (2012), in partic- 
ular, have suggested that it is not possible and have 
supported this suggestion with a series of simulations 
where they add noise to model galaxies and reselect these 
galaxies with their photometric redshift code. Dunlop et 
al. (2012) argue that noise in the photometry combined 
with a preference for selecting sources with blue colors 
would result in highly biased estimates for the mean UV- 
continuum slope (3. Dunlop et al. (2012) show that such 
a bias towards bluer J/y-continuum slopes at faint mag- 
nitudes is present in their mock data sets. Dunlop et 

" See also Figure 5 of Wilkins et al. 2011. 
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Fig. 10. — Comparison of the recovered f/V-continuum slopes j3 
versus near-IR magnitude for a z ~ 7 galaxy population with in- 
put (7V-continuum slopes /3 of —2.0 and —2.5 (see Appendix B for 
details). Shown are the results from the present Lyman- break selec- 
tion (solid blue lines), from a 6-band photometric redshift selection 
(dashed red lines), and from the Dunlop et al. (2012: Dll) "ro- 
bust" photometric redshift selections (solid red circles: from their 
Figure 8). The reason our measured f/V-continuum slope measure- 
ments /3 are not significantly biased towards fainter magnitudes is 
that we select samples using a different part of the rest-frame UV 
SED (bluest two band passes redward of the break) than we use 
to measure the (7V-continuum slope (second bluest bandpass and 
redder). See Figure|3] The situation is similar if one uses a 6-band 
photometric redshift selection (Appendix D). By contrast, Dun- 
lop et al. (2012) use overlapping information both to select their 
sources and to measure the (7V-continuum slopes. Therefore, while 
photometric scatter in the bands we use for selection has little effect 
on our f/y-continuum slope measurements, these steps are tightly 
coupled in the photometric redshift approach utilized by Dunlop 
et al. (2012). This results in the strong biases shown in this figure. 
While Dunlop et al. (2012) try to minimize the magnitude of these 
biases by restricting their analysis to the highest S/N sources, the 
same inherent biases in their /3 estimates will remain, but at a lower 
level. See §4.6, Appendix B.1.2, and Appendix D. 



al. (2012) argue that similar biases are likely present in 
Lyman-break selections, without further substantiating 
this claim. 

We agree that selection biases can affect the distri- 
bution of J/y-continuum slopes /3. However, the size 
of these selection biases is extremely dependent upon 
how one selects the galaxies and measures their UV- 
continuum slopes /3. As we show in Figure 1101 our 
Lyman-Break selections yield much smaller selection bi- 



ases overall. The reason we expect biases in our selec- 
tions to be small is that we select galaxies using a dif- 
ferent part of the rest-frame UV SED (the bluest two 
passbands redward of the break) than we use to measure 
[/V^-continuum slopes (the second bluest passband and 
redder). As a result, photometric scatter in the bands 
we use to measure the [/^-continuum slope /3 is largely 
independent of similar scatter in the bands we use for 
selection. Therefore, we would not expect our /? mea- 
surements to be significantly biased as a result of the 
object selection processFl 

Achieving similarly small biases to faint magnitudes 
with a photometric redshift technique is also possible. 
However, one must again be careful to use different in- 
formation to select sources from what one uses to mea- 
sure the ?7y-continuum slope /3. To illustrate this, we 
consider the situation at z ~ 7 in the current HST ACS 
+ WFC3/IR data set. We have run simulations where 
we attempt to measure the mean [/T^-continuum slope /3 
for a set of z ~ 7 galaxies selected using a photometric 
redshift procedure. Both the simulations and results are 
discussed in Appendix D. We consider (1) the case where 
5 HST bands are used to select sources and determine 
redshifts (this excludes those bands used to measure /?), 
(2) the case where 6 bands are used (and so now includ- 
ing one of the two bands used to measure /?), and (3) 
the case where all 7 HST bands are used for selection 
and redshift determination (and so both bands used to 
measure /3 are also included to measure redshifts and se- 
lect the sources). This latter approach is basically what 
Dunlop et al. (2012) do. 

The results are shown in Figure [211 of Appendix D. 
While (3 measurements show substantial biases when all 
7 bands are used for the photometric redshift estimates 
(similar to the procedure of Dunlop et al. 2012) (3 mea- 
surements made using 5 or 6 bands show much smaller 
biases. This demonstrates that the J/y-continuum slope 
f3 can be measured with very small biases to faint mag- 
nitudes. 

4.7. Comparison with Finkelstein et al. 2012 (z ^ A-7) 

In an independent analysis, Finkelstein et al. (2012) 
also use the recent WFC3/IR observations over the 
HUDF and CDF-South GOODS field to quantify the 
C/T^-continuum slope distribution for star-forming galax- 
ies at z ~ 4-8. Finkelstein et al. (2012) split sources into 
five different redshift samples using a photometric red- 
shift procedure and then estimate /3 by finding the model 
SED which best fits the photometry of a source and de- 
riving (3 from this model. Finkelstein et al. (2012) find 
that /3 shows a clear dependence on redshift, but report 
only a limited dependence on the UV luminosity. 

/3 vs. Luminosity Trends: While the redshift dependence 
Finkelstein et al. (2012) find for f3 is in excellent agree- 
ment with what we find (compare the solid black squares 
and large blue circles in the upper panel of Figure [8]) , 
the luminosity dependence Finkelstein et al. (2012) ob- 
serve would appear to be considerably weaker. After all, 
Finkelstein et al. (2012) report no significant correlation 

This issue is of course in addition to the normal selection 
biases that Lyman-Break samples show against sources with red 
C/V-continuum slopes /3, but as we show in Appendix B.1.1 these 
biases are small for all but the reddest /3's. 
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Fig. 11. — (upper panel) The approximate median UV- 
continuum slopes /3 (large red circles) derived by Finkeistein et al. 
(2012) for their sample as a whole (based primarily on the CAN- 
DELS+ERS programs) and two fainter subsamples from the HUDF 
(-20 < Muv < -18 and -18 < Muv < -16). The median /3's 
we calculate for these two fainter subsamples and the bootstrap 



uncertainties on these medians, 
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respectively, are based on the /3 measurements plotted in Figure 5 
of their paper (replicated here as the small red points). The solid 
magenta line shows the trend in /3 Finkeistein et al. (2012) report 
in their baseline analysis. The dashed red line shows the trend 
we find comparing the median /3's Finkeistein et al. (2012) mea- 
sure for their sample as a whole (large red circle at ^ —20 mag) 
with the median /3's we calculate for their two fainter subsamples 
in the HUDF (large red circles at ~ —19 mag and ~ — 17 mag). 
The solid blue squares and lines are our own /3 determinations and 
are as shown in Figure [6] While Finkeistein et al. (2012) report 
no correlation between /3 and luminosity in their baseline analysis 
{magenta line), we observe quite a strong correlation with lumi- 
nosity, making exclusive use of their measurements of /3 for fainter 
sources in the HUDF to define the trend to lower luminosities {red 
line). The trend dfi / dMuv we find making exclusive use of their 
/3 measurements for the fainter HUDF sources, i.e,, — 0.06±0.02, 
is in much better agreement with what we find, i.e., — O.lliO.Ol 
than it is in their baseline analysis. The final version of Finkei- 
stein et al. (2012) also reports this same trend making exclusive 
use of the fainter sources from the HUDF. The median /3's Finkei- 
stein et al. (2012) measure for sources in the two fainter-magnitude 
HUDF subsamples shown here {two large red circles) are also in 
very good agreement with our own measurements {blue squares), 
particularly at ~ —19 mag. (lower panel) Similar to the upper 
panel, but comparing results from the z ~ 5 sample of Finkeistein 
et al. (2012) with our own results (see §4.7). The two fainter sub- 
samples of z ~ 5 galaxies from the HUDF are over the magnitude 
ranges -20.3 < Muv < -18.3 and -18.3 < Muv < -16.3. As in 
the upper panel, we note better agreement with the Finkeistein et 
al. (2012) /3 measurements, if we restrict our comparison to their 
/3 measurements from the HUDF. 



of (3 with UV luminosity in their basehne analyses of 
their five redshift samples - seemingly different than the 
clear correlation of /3 with luminosity we report. 

Despite these apparent differences, the overall results 
from the two studies are actually in fairly good agree- 
ment. For example, with regard to the z ^ 6 and z ~ 7 
samples, the best-fit dP/dMjjv values Finkeistein et al. 
(2012) find, i.e., -0.10 ± 0.07 and -0.20 ± 0.11, respec- 
tively, are strikingly similar to the values we find, i.e., 
-0.15 ± 0.04 and -0.21 ± 0.07, respectively. 

For the z ^ 4 and z ~ 5 samples, Finkeistein et al. 
(2012) do not report a significant correlation between 
/3 and UV luminosity in what they identify as their 
basehne analysis (finding d[3/dMuv values of 0.01±0.03 
and 0.00±0.06, respectively). However, in this analysis, 
Finkeistein et al. (2012) only make use of a 2-mag base- 
line in luminosity (due to their binning scheme), with 
their low luminosity anchor point largely coming from 
the relatively shallow ~1.6 orbit CANDELS data (with 
only a small contribution from the HUDF data). The 
CANDELS observations are clearly poorly suited to de- 
termine the trend in /3 to very low luminosities, given the 
very low S/N's and potentially large biases expected for 
the faintest sources in the CANDELS fields. We might 
expect the situation to change taking advantage of the 
additional leverage in luminosity provided by the (3 mea- 
surements they provide for faint sources in the ultra-deep 
HUDF observations. 

We can check this by extracting the HUDF measure- 
ments from Figure 5 of their paper. By comparing 
their median (3 measurements for brighter galaxies with 
their median /3 measurements for fainter galaxies in the 
HUDF, we find evidence for a significant correlation with 
luminosity. We find dp/dMuv trends of -0.06±0.02 
and — 0.13±0.04, respectively. In the final version of 
their paper, Finkeistein et al. (2012) also note a simi- 
lar correlation with luminosity making use of the faintest 
HUDF sources, finding d(3/dMuv trends of -0.07±0.01 
and — 0.09±0.03, respectively. While not in exact agree- 
ment with the trends we derive based on our own l3 
measurements, i.e., — O.lliO.Ol and — 0.16±0.03, respec- 
tively, the agreement is much better. Use of the faint 
sources in the HUDF is important to take full advantage 
of the available leverage in luminosity to quantify the l3 
vs. Mjjv trend. 

In Figure [Til we show the /3 vs. Muv trend that 
Finkeistein et al. (2012) find in their baseline z ^ A 
and z ^ 5 analyses {magenta lines) and the trend we 
find from their measurements making exclusive use of 
sources from the HUDF to constrain (3 to fainter magni- 
tudes (dashed red lines). In addition, we show the me- 
dian /3's Finkeistein et al. (2012) find for two fainter z ^ 4 
and z ^ 5 subsamples within the HUDF (large solid red 
circles: we can extract /3 measurements for individual 
sources within the HUDF from their Figure 5) and the 
bootstrap uncertainties on these medians. In both sam- 
ples there is a clear trend in the median /3's towards bluer 
values at the lowest luminosities. It is striking how well 
the median /3's Finkeistein et al. (2012) derive from the 
HUDF agree with our own /3 measurements, particularly 
in the luminosity interval [—20 mag, —18 mag]. While 
it is true that the median /3's Finkeistein et al. (2012) 
derive for their entire ERS-^CANDELS-f HUDF09 sam- 
ple are redder in general than what we find at these lu- 
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minosities, these median /3's receive their largest weight 
from the shallower ERS+CANDELS samples and there- 
fore may be subject to possible selection, measurement, 
or contamination biases (see the discussion at the end of 
this section). 

Expected Trend in Luminosity: Finkelstein et al. (2012) 
defend the weak correlation of (3 they report versus lu- 
minosity (particularly as derived in their baseline anal- 
ysis), arguing that (3 should show a stronger correlation 
with stellar mass than UV luminosity. We do not dis- 
pute this assertion; however, it would be most surprising 
if a correlation of /3 with stellar mass did not also ap- 
pear as a correlation with UV luminosity. Given the 
correlation found between SFR and stellar mass in high- 
redshift galaxies (e.g.. Stark et al. 2009; Gonzalez et al. 
2011; McLure et al. 2011; Lee et al. 2012; Reddy et 
al. 2012b), we would expect the UV luminosity to be 
broadly correlated with stellar mass. A similar conclu- 
sion can be drawn from high-redshift angular correlation 
function results. Higher luminosity galaxies are consis- 
tently found to be more clustered than lower luminosity 
galaxies (e.g., Ouchi et al. 2004b; Lee et al. 2006). Such 
would not be the case if UV luminosity was not corre- 
lated with mass (in this case halo mass). Independent 
of these considerations, we remark that /3 also shows a 
clear correlation with UV luminosity in various cosmo- 
logical hydrodynamical simulations (e.g., Finlator et al. 
2011; Dayal & Ferrara 2012), and the predicted trends 
(e.g., d/3/dMijv ~ —0.10 is expected in the Finlator et 
al. 2011 simulations) are comparable to what we find 
(Figure [Z]). 

Possible Biases in the Finkelstein et al. measurements: 
Finkelstein et al. (2012) have suggested that the P-Mjjv 
trends we find may be stronger than what they find due 
to the fact that we measure the UV luminosity at a differ- 
ent rest-frame wavelength than they do, and /3 vs. Muv 
trends may depend on this rest-frame wavelength. For 
our z ~ 4 samples, for example, we measure the rest- 
frame UV luminosity at 204lA(see Table[3]) while Finkel- 
stein et al. (2012) measure it at 1500A. We are in full 
agreement that dp/dMjjv will depend on the rest- frame 
wavelength where the UV luminosity is derived. How- 
ever, the results from Figure 3 in Labbe et al. (2007) 
suggest one would find an even stronger /3 vs. dMuv 
trend at bluer wavelengths than one would find at red- 
der wavelengths, which is different from what Finkelstein 
et al. (2012) find. It is therefore not clear this explains 
the differences. 

Instead, one might be concerned that the (3 vs. dMjjv 
trend Finkelstein et al. (2012) find may be biased as 
a result of the rest-frame wavelength Finkelstein et al. 
(2012) use to measure the Muv luminosity^ By de- 
termining luminosity at the blue end (at 1500 A) of the 
wavelength baseline they use to derive /3, Finkelstein et 
al. (2012) effectively introduce a coupling between the 
errors that affect both their /3 measurements and their 
determinations of the UV luminosity Mjjv- This could 

^'^ Finkelstein et al. (2012) also discuss this issue at some length 
in their paper (as a source of differences between the /3 vs. Mjjy 
trends we find) and would appear to find a similar effect, but given 
its importance for understanding differences between our results, 
we feel this discussion is worth repeating. 



be problematic since any errors in the fiux measure- 
ments of sources would cause sources to be either fainter 
and redder or brighter and bluer, causing the dj3/dMuv 
trend derived by Finkelstein et al. (2012) to be biased 
towards too high of values. Repeating the determination 
of dj3 / dMjjv at z ~ 4-5 based on our own fiux measure- 
ments but basing Muy on the flux measurement at the 
blue end of the wavelength baseline to derive /3 and us- 
ing only the wide-area CANDELS-KERS sources, we es- 
timate that this could bias the derived d(3/dMuv trend 
too high by A{dl3/dMuv) ~ 0.05. This bias is analo- 
gous to the photometric error coupling bias we discuss in 
Appendix D. However, instead of the coupling being be- 
tween source selection and the /3 measurements, it is be- 
tween the measurement of /3 and the measurement of the 
UV luminosity Af[/vE3 Contamination in the shallower 
CANDELS-HERS samples (from lower redshift galaxies) 
could also be an issue for Finkelstein et al. (2012) in de- 
riving the trend in /3 to lower luminosities Muy. 

Given the much smaller fiux uncertainties for z ~ 4-5 
sources in the HUDF (and smaller contamination rates) , 
we would expect the /3's Finkelstein et al. (2012) mea- 
sure there to show significantly smaller biases than /3's 
measured for sources in ERS+CANDELS fields at the 
same luminosities. Encouragingly enough, the median 
/3's Finkelstein et al. (2012) derive from the HUDF are 
in good agreement with our own, particularly in the lu- 
minosity interval [—20 mag, —18 mag] (compare the blue 
squares and large red circles in both the upper and lower 
panels of Fiaure lll^ . 

Finally, we remark Finkelstein et al. (2012) use the 
same photometry both to select sources and measure /3. 
Given the argumentation in the previous section and Ap- 
pendix D (see also Dunlop et al. 2012), we might expect 
the bias in the derived /3's to be non-zero. However, in 
practice, given the large number of passbands used to se- 
lect sources and measure their redshifts, the bias is likely 
to be quite small except at z ^ 7 (similar to our study: 
but see §4.8). Finkelstein et al. 2012 are aware of this 
issue and explicitly discuss it in their paper. 

4.8. How blue are lower luminosity galaxies at z 7? 

Galaxies with the most extreme UV properties are ex- 
pected to lie at very high redshift and have low lumi- 
nosities - given the early cosmic times in which they are 
observed and likely low masses. It has therefore been 
of considerable interest to establish the f/F- continuum 
slopes /3 for the faintest observable z ^ 7-8 galaxies. 
Early observations of such galaxies in the HUDF gave 
tantalizingly steep values of the fJT^-continuum slope /3, 
i.e., /3 ^ —3. 

In the present study, we find a mean [/l^-continuum 
slope /3 of —2.7 ± 0.2 for faint z ^ 7 galaxies. This 
is slightly redder than the mean C/t^-continuum slope /3 
(—3.0 ± 0.24) found in our earlier study of /3 for z ^ 7 
galaxies in the HUDF. It is also slightly redder than that 
(-3.0 ± 0.5) found by Finkelstein et al. (2010) using the 

Note that this same issue does not substantially bias the /3 vs. 
Mjjy trends we find because we take the UV luminosity to be the 
geometric mean of the luminosity measurements that contribute to 
our (9 measurements (§3.4). Flux measurements on either end of 
the baseline used to derive /3 bias the vs. Muv trend in opposite 
directions and should largely cancel. 
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same dataI3 Dunlop et al. (2012) have argued that it 
is not possible to estimate the [/T^-continuum slope /? 
at such low luminosities, but as we discuss in §4.6, such 
measurements are possible if care is taken to minimize 
biases by ensuring the information used for selection is 
independent of that used to measure /3. 

How robust are our measurements of the mean UV- 
continuum slope /? for faint z ~ 7 galaxies? While our 
simulations suggest the biases are not large, it is use- 
ful to check this result by obtaining an independent es- 
timate of the mean (3. For this estimate, we use two 
completely independent data sets to select sources and 
to measure (3. Source selection is done using the first- 
year WFC3/IR observations over the HUDF (18 orbits 
Yi05, 16 orbits J125, 28 orbits ifieo) from the HUDF09 
program while the measurement of /3 is done using the 
second- year WFC3/IR observations over the HUDF (18- 
orbit J125, 25-orbit i/ieo data). Since the second- year 
data were not used to select the sample, we can make an 
unbiased measurement of the [/^-continuum slope /3 for 
this sample using the new observations^ For compari- 
son with the Bouwens et al. (2010a) study on (3, we use 
the same z 7 sample. The biweight mean /3 we derive 
for our lowest luminosity (Mjjv.ab ^ —18.5) subsam- 
ple is — 2.8 ± 0.2. While this is slightly redder than the 
(3 = — 3.0±0.2 we find from the first-year observations 
(Bouwens et al. 2010a), this completely independent and 
unbiased estimate does suggest the C/V^- continuum slopes 
/3 for lower luminosity galaxies at 2; ~ 7 are very blue. 

As one final check on the mean /3 for faint z ^ 7 galax- 
ies, we considered one variation on the previous test. 
We divided the Ji25-band data for each HUDF09 field 
(HUDF09, HUDF09-1, HUDF09-2) into two disjoint sub- 
sets and produced separate Ji25-band reductions from 
each. The Yios-band data and first third of the Ji25-band 
data were used for the selection of z 7 sources, and 
the iJi6o-band data and final two-thirds of the Ji25-band 
data were used to measure the t/F-continuum slopes (3. 
As in the previous test, this is to ensure that the informa- 
tion used for source selection is completely independent 
of that used for the /3 measurements, and therefore the 
photometric error coupling bias (discussed in §4.6 and 
Appendix B.1.2) must be zero. The biweight mean (3 we 
derive for the faintest z ~ 7 sample {Muvab ^ —18.3) 
based on the three HUDF09 fields (HUDFb9, HUDF09- 
1, and HUDF09-2) is -2.7 ± 0.2, again consistent with 
our other estimates. 

This new determination of the mean ?71^-continuum 
slope /? for very low luminosity z ^ 7 galaxies is very 
blue (i.e., /3 ^ —2.7 ± 0.2). However, this does not ap- 
pear to be especially anomalous. In fact, it appears to 
be consistent with what one might expect extrapolating 
the z ~ 4-6 C/T^-continuum slope /3 relationship to z 7 
(see e.g. dashed line in the z '--^ 7 panel to FigureH]). The 

Despite an apparent difference in the quoted uncertainties 
on the moan /3, the Bouwens et al. (2010a) and Finkelstein et 
al. (2010) error estimates are actually quite similar, if considered 
over the same magnitude interval and using similarly-sized sam- 
ples. Finkelstein et al. (2010) consider a sample that is half as 
small and 0.25 mag fainter — which not surprisingly results in a 
larger quoted uncertainty for /3. 

We note that this measurement could still be affected (at the 
level of A/3 ~ 0.1) by the selection volume bias (Appendix B.1.1) 
even though the photometric error coupling bias (Appendix B.1.2) 
will be zero. 
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Fig. 12. — Biweight mean t/V-continuum slope /3 versus the ab- 
solute magnitude in the rest-frame UV. The mean (/V-continuum 
slope results at 2 ~ 4 (blue), 2 ~ 5 {green), 2 ~ 6 {red), and 
2 ~ 7 {black) from Figure |6] are summarized here and compared 
with the (7y-continuum slopes (5 expected by the Finlator et al. 
(2011) cosmological hydrodynamical simulations at 2 ~ 6-7 {solid 
purple line). The top axis gives the median stellar mass Gonzalez 
et al. (2011) found for galaxies at a given rest-frame UV luminos- 
ity. Outside of the range —21 < Mi;v,AB < —18.5, these stellar 
masses are an extrapolation of the trends found by Gonzalez et 
al. (2011). The mean slopes /3 found at 2 ~ 4, 2 ~ 5, and 2 ~ 6 
are offset to bluer values by a uniform A/3 ~ 0.1-0.3 offset to show 
how similar the dependence of /3 on luminosity is at redshifts 2 ~ 4, 
2 ~ 5, and z ~ 6 where the l/V-continuum slopes are the most well 
defined. The existence of some change (or offset) in /3 versus red- 
shift is plausible - as both the age and dust properties of galaxies 
could easily be a function of cosmic time. The dependence of /3 
on luminosity at 2 ~ 7 appears to be stronger, but is consistent 
with the other redshifts at Icr. The Finlator et al. (2011) results are 
shifted A/3 ^ 0.10 bluer to better illustrate their similarity with the 
observed trends. Tableland Figurc[7]provide our best-fit determi- 
nations of how the (yV-continuum slope /3 depends on luminosity 
at 2 ~ 4-7. The similar luminosity dependencies strongly argue 
that the observed luminosity dependence is real and indicative of 
a sequence in star-forming galaxies at 2 ~ 4-7 (§5.1). 

observed correlations of /3 with both redshift and lumi- 
nosity are such that one would expect faint z ~ 7 sources 
to be very blue. This blue P is also not inconsistent with 
what one can achieve with standard stellar population 
modeling (where the J/l/- continuum slope (3 can become 
as steep as ^—2.7: Schaerer 2003; Bouwens et al. 2010a; 
Robertson et al. 2010). The observed C/y-continuum 
slopes /3 at z 7 therefore seem to provide no particu- 
larly compelling evidence for exotic stellar populations, 
i.e., very low metallicity stellar populations or a high es- 
cape fraction (see also Finkelstein et al. 2012). Bouwens 
et al. (2010a) briefly speculated as to what such blue /3's 
might imply, if future observations conflrmed that /3 was 
really as blue as —3 with small uncertainties. 

5. DISCUSSION 

5.1. Sequence in SF Galaxies at High Redshift 

In the previous section, we presented evidence that the 
C/y-continuum slopes (3 of star-forming galaxies at high 
redshift were distributed along a well-defined sequence 
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in UV luminosity. The intrinsic scatter in /3 along the 
sequence is small {ap ^ 0.34), and the dependence of 
13 is such that galaxies become bluer towards lower lu- 
minosities. Such a sequence is particularly prominent in 
our z ^ 4 sample, but all of our higher redshift selections 
(z 4, z ^ 5, z ~ 6, z 7) show strong evidence for 
such a sequence as well (Figure H]) . 

The fact that we observe the same dependence of /3 on 
UV luminosity in each of four Lyman-Break selections 
suggests the trend we are recovering from the observa- 
tions is real. Such a color-magnitude relationship was 
already evident in many studies of galaxies over the red- 
shift range z ^ 2-5 - though the clearest evidence was 
presented by Bouwens et al. (2009) for z ~ 2.5 and z ~ 4 
samples and Labbe et al. (2007) for z ~ 1.05, z ^ 1.8, 
and z ~ 2.7 samples. See §4.4. The present work con- 
firms these trends and extends them to z ^ 5, z 6, and 
z ~ 7. 

Consistent with these trends, the C/y-continuum 
slopes /3 we measure for the most luminous (and presum- 
ably most massive) galaxies have similarly red values for 
P of —1.6 to —1.9 in all four redshift samples examined 
here (Table |4j similar values were also found by Lee et al. 
2011 and Willott et al. 2012) while the lowest luminosity 
galaxies probed have relatively similarly blue values for 
(3 of —2.2 to —2.7 for all four samples. 

The existence of well-defined sequences for both star- 
forming and evolved galaxies at lower redshift is now very 
well established. In the local universe (z ^ 0.1), for ex- 
ample, Salim et al. (2007) find that galaxies fall along a 
well-defined sequence in SFR versus stellar mass. Sim- 
ilar star-forming sequences were found by Noeske et al. 
(2007) and Martin et al. (2007) at somewhat higher red- 
shifts, from z ^ 0.2 to z ^ 1 (with a 0.3 dex scatter in 
the SFRs). Daddi et al. (2007) find evidence for such 
a sequence in star-forming galaxies at z ~ 2, and El- 
baz et al. (2011) show that such a sequence exists for 
even more luminous systems from recent Herschel obser- 
vations. While the present color-magnitude sequence we 
observe is not a SFR vs. stellar mass sequence, the exis- 
tence of such a sequence suggests that galaxies build up 
in a relatively well-defined way, versus cosmic time. 

Theoretically, we would expect such a sequence due to 
the build-up in metals and dust anticipated to occur as 
galaxies grow in luminosity and mass. One useful illus- 
tration of this can be seen in some recent work by Dave 
et al. (2006) and Finlator et al. (2011) who use smooth 
particle hydrodynamics to model the evolution of galax- 
ies to z ~ 6. In Figure 6 of Dave et al. (2006), for 
example, we see a clear mass-dependence in the metal- 
licity of galaxies, with ^^0.3 dex change in metallicity per 
^1 dex change in mass. 

Figure 7 of Finlator et al. (2011) shows the expected 
trends in J/y-continuum slopes /3 as a function of lu- 
minosity, including the effects of starlight, dust, and 
emission lines. Finlator et al. (2011) predict mean UV- 
continumn slopes of ~— 2.02 for luminous (Muv^ab ^ 
—20.5) z ^ 7 galaxies and ~— 2.28 for lower luminosity 
[Muv,AB ^ —18.5) z: ~ 7 galaxies - equivalent to an 
approximate slope to the /3 - Mjjv relationship of just 
'^— 0.13. A fit to the /3 vs. Muv relationship for all 
the z ^ 4, z ^ 5, z 6, and z 7 sources in the 
Finlator et al. (2011) simulations yield d/3/dMiJv slopes 
of 0.10, 0.08, 0.13, and 0.09, respectively 
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Fig. 13. — Sensitivity of the (7V-continuum slope (5 (wavelength 
baseline I6OOA to 2700A) to changes in the mean metallicity, 
age, or dust extinction of a galaxy population. For our fiducial 
model (where /3 ^ —1.5), we assume t = 70 Myr, r = 10 Myr, 
[Z/Zq] = -0.7, E(B -V) = 0.15, and a Salpeter IMF (where 
the star formation history is parametrized as e~'/'^) from the Pa- 
povich et al. (2001) fits to 2 ~ 2.5 i7-dropouts from the WFPC2 
HDF North. In modifying our fiducial model to have younger ages, 
we make changes to both t and r. Factor of 2 (0.3 dex) changes 
in the mean metallicity, age, or dust content of galaxies result in 
0.07, 0.15, 0.35 changes in the C/y-continuum slope /3. Similar to 
Figure 7 from Bouwens et al. (2009) but for l/V-continuum slopes 
derived over a wider wavelength baseline. This wavelength baseline 
is appropriate given our procedure for determining /3 using flux in- 
formation over a wide wavelength baseline (§3.3; Figure[4l TablejS]!. 
It seems clear that changes in the mean dust content of galaxies at 
high-redshift likely have the biggest effect on the (7V-continuum 
slope (5 and setting up trends with luminosity and possibly redshift 
(§5.2). 



(Finlator 2011, private communication). The mean slope 
to this relationship df3/dMuv of ^ —0.10 is in excellent 
agreement with what is observed (Figure [7] and Table HJ. 
To better illustrate this, we include a comparison of the 
C/y-continuum slopes /3 observed with that predicted 
from the Finlator et al. (2011) simulations (Figure [T^ . 

Rest-frame optical studies of z 4-6 galaxies with 
Spitzer IRAC have provided evidence for a similar se- 
quence at z > 4 for star-forming galaxies (Stark et al. 
2009; Labbe et al. 2010b; Gonzalez et al. 2011). Typi- 
cal star-forming galaxies at z ^ 5 have rest-frame UV- 
optical colors of ~0.6 mag, with a scatter of 0.5 dex 
(Gonzalez et al. 2011, 2012). These f/F-optical colors 
appear to show a slight dependence on luminosity, in the 
sense that brighter galaxies are redder and fainter galax- 
ies are bluer (Gonzalez et al. 2012). So far there is no 
evidence for evolution in the UV-optical colors over the 
redshift range 4 < z < 6 (Stark et al. 2009; Labbe et al. 
2010b; Gonzalez et al. 2011). Both trends in the UV- 
optical colors parallel those found in the f/l^-continuum 
slopes. Together these findings suggest that the evolu- 
tion of galaxies at high-redshift may be self-similar (see 
also Gonzalez et al. 2012). 
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Fig. 14. — Estimated dust extinction {Lju/Li/y + i) versus UV 
luminosity at redshifts 2 ~ 4 (blue), 2 ~ 5 (green), z ~ 6 (black), 
and 2 ~ 7 (red: see §5.3). The dust extinction is estimated based 
on the measured (71^-continuum slopes (Tableland Figure|6)l and 
adopting the Meurer et al. (1999) IRX-/3 relationship. The solid 
circles and lines correspond to the results using the biweight mean 
C/V-continuum slopes and linear fits to the biweight means (Fig- 
ure [6]l. A smaller but similar dust extinction would be derived 
based on the 2 ~ Overzier et al. (2011) Lyman Break Analogue 
sample. The uncertainties we estimate for the dust extinction al- 
most exclusively derive from the assumed systematic errors in /3, 
i.e., A/9 ~ 0.10-0.28. The typical dust extinction inferred for lumi- 
nous galaxies is much larger than it is for lower luminosity galaxies. 
The dependence of the dust extinction on redshift is not as large as 
it is on UV luminosity, but the dust extinction for higher redshift 
galaxies is lower than it is for lower redshift galaxies at the same 
luminosity. 



TABLE 6 

The effective dust extinction (at ~ 1600 A) estimated 

FOR THE LBG population INTEGRATED DOWN TO VARIOUS 
UV LUMINOSITIES (§5.3: SEE ALSO FIGURE [TSll . 



> 0.3L*^3 
Sample (< — 19.8 mag) 



Effective Extinction 
> 0.05L*^3 
(<-17.7 mag) 



> 0.00005L*^3 
(<-10 mag) 



Using Meurer et al. (1999) Relationship' 



a,b 



2.5 
2.5 
1.8 

1 

4.8 



+0.1+0.4 
0.1-0.4 
+0.3+0.4 
0.2-0.4 
+0.2+0.4 
0.2-0.3 
+0.3+0.6 
-0.2-0.4 

+ 1.0 + 1.7 
0.8-1.3 



2.1 
1.9 
1.4 

1.3 

3.5 



+0.1+0.3 
0.1-0.3 
+0.2+0.3 
0.2-0.3 
+0.2+0.2 
0.2-0.2 
+0.2+0.4 
-0.1-0.3 



7+0.1+0.2 
-0.1-0.2 
^ ^+0.1+0.2 



1.7 
.5 
1.2 

1.1^ 



0.1-0.2 
+0.1+0.1 
0.1-0.1 



+0.7 + 1.3 
0.6-0.9 



2.4 



+0.4+0.9 
0.3-0.6 



^ The effective dust extinctions given here are the multiplica- 
tive factors, i.e., Lm/Ljjv + Ij needed to correct the observed 
UV luminosity densities at ~1600A to their intrinsic values, 
after integrating to specific limiting luminosities (specified at 
the top of each column). These extinctions are estimated us- 
ing Meurer et al. (1999) IRX-/3 relationship and integrating 
over the distribution of f/V-continuum slopes observed (Ta- 
ble |4]l. Notice that the dust extinctions arc much lower when 
integrated to very low luminosities (see also Reddy & Steidel 
2009; Bouwens et al. 2009). 

Both random and systematic errors are quoted (presented 
first and second, respectively). 
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Fig. 15. — Dust extinction (Liji/Luy + l) versus redshift (§5.3). 
These factors are calculated from the f/V-continuum slope /3 distri- 
bution using two different IRX-/3 relationships: the Meurer et al. 
(1999) relationship (red lines) and the one derived from Lyman- 
IBreak Analogues (Overzier et al. 2011: black line). The total cor- 
rection factors for the luminosity density are integrated down to 
three different limits for the Meurer et al. (1999) IRX-/3 relationship 
corresponding to 0.3 L*^—rj (the approximate limiting luminosity for 
searches over the GOODS fields), 0.05 i*^3 (the approximate lim- 
iting luminosity for searches over the HUDF09 data), and 0.00005 
L*_3 (the approximate lowest luminosity we might expect galax- 
ies to form: e.g.. Read et al. 2006; Dijkstra et al. 2004); these 
are shown with the dashed, solid, and dotted lines, respectively. 
The corrections are weighted according to luminosity based on the 
2 ~ 4, 2 ~ 5, 2 ~ 6, and 2 ~ 7 LFs from Bouwens et al. (2007) 
and Bouwens et al. (2011b). The dust correction factors are also 
presented in Table [6] This figure is similar to Figure 4 of Overzier 
et al. (2011) but updated to include our latest results on the UV- 
continuum slope /3 distribution at 2 > 4. Our estimated uncer- 
tainties on the dust extinction almost exclusively derive from the 
assumed systematic errors on /3, i.e., A/3 ~ 0.10-0.28. The correc- 
tion factors we find show a slight dependence on redshift, but not 
nearly as strong as what Bouwens et al. (2009) inferred using the 
ACS+NICMOS observations (Figure 8 of Bouwens et al. 2009). 

5.2. Sequence in SF Galaxies: Interpreting (3 vs. 
luminosity trends 

In the previous section, we briefly discussed the weh- 
defined sequence in /?'s we observed versus luminosity 
in our z ~ 4-7 samples as another instance of a "star- 
forming" sequence for galaxies. How shall we interpret 
the changes we observe in the mean UV-contimiwni slope 
/3 of galaxies on this sequence versus their UV luminos- 

Given the approximate correlation of galaxy mass with 
UV luminosity (e.g., Ouchi et al. 2004b; Lee et al. 2006, 
2009; Stark et al. 2009) and gradual build-up of galaxies 
in mass, we would expect the mean properties of galaxies 
to change gradually as a function of their UV luminosity. 
While we can imagine many properties of galaxies driving 
changes in the f/V^- continuum slope /3 as a function of 
UV luminosity, i.e., dust, age, metallicity, AGN content, 
changes in the dust content of galaxies would likely have 
the largest effect. Figure [13] provides a simple illustration 
of this. 0.3 dex changes in the dust content have a much 
larger effect on the ?71^-continuum slope /? than similarly- 
sized changes in the age, metallicity, or the stellar IMF. 

Moreover, given the mass-metallicity relationship ob- 
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served at z ^ 0-4 (e.g., Tremonti et al. 2004; Erb 
et al. 2006a; Maiolino et al. 2008), we would expect 
galaxy metallicity - and hence dust content - to show 
a strong correlation with mass (and luminosity) . Higher 
luminosity galaxies would largely be redder because of 
their greater dust content while lower luminosity galax- 
ies would be bluer due to a scarcity of dust. Such a 
correlation of dust content with mass has been explicitly 
shown (e.g.. Figure 18 from Reddy et al. 2010 vand Fig- 
ure 5 from Pannella et al. 2009). We would, of course, 
also expect changes in the metallicity and age of star- 
forming galaxies to contribute to the observed trends in 
/3; however their effect on the C/l^-continuum slope would 
likely be much smaller in general. See also Bouwens et 
al. (2009: §4.4) and Labbe et al. (2007). 

5.3. Sequence in SF Galaxies: Dust Extinction 

Based upon the above tests and discussion, we will as- 
sume that dust extinction is the dominant variable in 
setting up these observed trends. We therefore use our 
results on the [/F-continuum slope /? distribution to es- 
timate a mean dust extinction for high-redshift galax- 
ies. We will make use of well-known IRX-/3 relationships 
known to work well at z ~ (e.g., Meurer et al. 1999; 
Burgarella et al. 2009; Overzier et al. 2011) and z ^ 2 
(e.g., Reddy & Steidel 2004, Reddy et al. 2006a, 2010, 
2012a; Daddi et al. 2007). The canonical z = IRX-/3 
relationship (Meurer et al. 1999) is 

Aieoo = 4.43 + 1.99/3. (2) 

where Aiqqq is the dust extinction at I6OOA. The Meurer 
et al. (1999) approach is functionally equivalent to cor- 
recting for dust extinction based upon the Calzetti et al. 
(2000) dust curve. 

Of course, use of the Meurer et al. (1999) IRX-/3 re- 
lation at 2; > 3 has not been without controversy, and 
there has been suggestions that the dust extinction in 
high redshift galaxies may be either higher or lower than 
that implied by the Meurer et al. (1999) relationship. 
Certainly we might expect some change given that it is 
likely that AGB stars - thought to be the principal sites 
for the formation of dust - will not be present in the uni- 
verse until the universe is at least 1 Gyr old, and therefore 
the dust that existed in the first 1 Gyr of the universe 
must have formed in another way, e.g., in the winds of 
supernovae (e.g., Maiolino 2006; Maiolino et al. 2008). 
The attenuation curve for dust from AGB stars may be 
very different from dust of other origin (e.g., from SNe). 

Dust obscuration would be higher in the high redshift 
universe if the attention curve were flatter than Calzetti 
et al. (2000) while the obscuration would be lower if 
the attenuation curve were steeper than Calzetti et al. 
(2000), i.e., much more like that from the SMC. A flat- 
ter (steeper) attenuation curve implies a higher (lower) 
dust extinction for a given UV slope. Arguments for its 
being flatter come from efforts to derive the dust proper- 
ties of QSOs (Gallerani et al. 2010) while arguments for 
its being steeper follow from studies of very young galax- 
ies at z ^ 2-3. Both Reddy et al. (2006b) and Siana et 
al. (2008, 2009) find that dust corrections implied by the 
C/T^-continuum slopes /3 of young galaxies are much too 
large for the Meurer et al. (1999) IRX-/3 relation to ap- 
ply. Chary & Pope (2011) argue for low dust extinction 



in high-redshift galaxies based upon extragalactic back- 
ground light stacking results. It has also been argued 
that the Carilli et al. (2008) stacking results of the radio 
emission in z ~ 3 galaxies from COSMOS also suggest 
lower values for the dust extinction, but Reddy et al. 
(2012a) dispute this, arguing that the Carilli et al. (2008) 
results are consistent with previous results (which sup- 
port the Meurer et al. 1999 IRX-/3 relationship in the 
mean) . 

The above arguments aside, there is circumstantial 
evidence that dust obscuration in high-redshift galax- 
ies is likely at least as large as implied by the Meurer 
et al. (1999) relationship (see also discussion in §6.3). 
Perhaps the strongest piece of evidence for substantial 
dust extinction is provided by the large number of high 
mass (^1-3x10^° -^0) galaxies found at z ~ 5-6 in the 
GOODS fields (Eyles et al. 2005; Yan et al. 2005, 2006)0 
Building up these stellar masses by z ~ 6 requires SFRs 
of ^20-30 Mq I yr, even assuming constant SFRs for 1 
Gyr. Without dust extinction, the progenitors to these 
massive galaxies would need to be very luminous indeed, 
i.e., —22 AB mag (Yan et al. 2006). However, such galax- 
ies are not observed at z > 7 in the requisite numbers 
(e.g., Bouwens et al. 2011b), arguing that the progeni- 
tors must be moderately dust obscured or have existed 
in a smaller form (i.e., having subsequently merged). 

One other possible uncertainty in the dust corrections 
regard the possible impact of scatter in the IRX-/3 rela- 
tionship. Smit et al. (2012) found that scatter in this 
relationship could potentially have a modest effect, i.e., 
~0.09 dex, on the estimated dust extinction, but it de- 
pended in detail on the precise cross correlations between 
dust extinction, /3, UV luminosity, and the SFR. Since 
the effect of scatter in the IRX-/3 relation on the mean 
dust extinction is not at all clear (and plausibly consis- 
tent with no net change), we will not consider a correc- 
tion at this time. 

Utilizing Eq. [2] and the observed distribution of UV- 
continuum slopes (Table |4]), we can estimate the mean 
extinction corrections (L/ij/Lfjy-l-l) as a function of W 
luminosity. We have plotted the results in Figure [U for 
the four different redshift intervals considered here. As in 
other work (Bouwens et al. 2009; Reddy & Steidel 2009; 
Sawicki 2012), we find that the typical dust extinction 
in galaxies increases systematically as a function of the 
UV luminosity. Note that in calculating these mean ex- 
tinction factors we integrate over the full f/V^- continuum 
slope distribution. This distribution is approximated as 
a normal distribution with the biweight means given in 
Table |4]) and Icr scatter of 0.34 (the median scatter pre- 
sented in Table HI). We take Aigoo — when Aigoo < 
in Eq. [5] above. 

To determine the actual extinction corrections that are 
appropriate for real samples, we must weight these ex- 
tinction corrections according to the UV LFs determined 
at z ~ 4, z ~ 5, z ~ 6, and z ^ 7 integrated to specific 
limiting luminosities. We will make use of the z ~ 4, 
z ^ 5, z 6, and z ~ 7 LFs of Bouwens et al. (2007) and 
Bouwens et al. (2011b). The results are presented in Ta- 

Nonetheless, it is worth noting that early estimates of the 
stellar masses for 2 ~ 6 galaxies may have been somewhat too high 
as a result of their neglecting to account for the effect of rest-frame 
optical emission lines on the observed IRAC fluxes. 
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ble [S] and Figure [121 We also include the dust extinction 
we would derive using an alternate IRX-/3 relationship 
A1600 = 4.01 + 1.81/3 derived by Overzier et al. (2011). 
Overzier et al. (2011) derive this relationship based upon 
a small sample of 2; '~ galaxies with similar properties 
to z ^ 2-3 Lyman-Break galaxies. 

The extinction corrections given in Table [S] and Fig- 
ure [15] are an update to the estimates we previously pro- 
vided in Bouwens et al. (2009) based upon the best UV- 
continuum slope /3 estimates available at that time. Rel- 
ative to the Bouwens et al. (2009) extinction estimates, 
the most significant change is in the extinction estimates 
we find at 2; ^ 4 which are now ~1.5-2x lower (see also 
Castellano et al. 2012). This is the direct result of the 
somewhat bluer C/l^-continuum slopes /? we find using 
the present ACS-I-WFC3/IR photometry. Such photom- 
etry extends over a wider wavelength baseline than is 
available using ACS alone (as employed by Bouwens et 
al. 2009) and therefore allows for much more accurate 
estimates of (3. 

5.4. Sequence in SF Galaxies: Interpreting j3 vs. 
redshift trends 

The above discussion highlights several clear trends 
that are present in the properties of star-forming galax- 
ies at z ~ 4-7 as a function of their UV luminosity and 
presumably as a function of their stellar mass. The fo- 
cus of this discussion was the ?71^-continuum slopes /3 of 
galaxies - and by inference - their overall dust extinction. 

However, we also noted that the observed /3's in the 
apparent SF sequence showed a modest dependence on 
the redshift of the sources. At face value, this suggests 
that the dust extinction in galaxies must increase, from 
high redshift to low redshift (as per the discussion in §5.2- 
§5.3). We might expect such an evolution based on the 
gradual build-up of both metals and mass in star-forming 
galaxies with cosmic time, as seen in the evolution of 
the mass-metallicity relationship (e.g., Tremonti et al. 
2004; Erb et al. 2006a; Maiolino et al. 2008; Mannucci et 
al. 2009; Laskar et al. 2011), or based on the evolution 
in the observed correlation between dust extinction and 
bolometric luminosity from z ^ 2 to z ~ (e.g., Reddy 
et al. 2006b, 2010; Buat et al. 2007). Such a change 
in the mean dust content of galaxies is also supported 
by the simulations of Finlator et al. (2011) who predict 
essentially the same evolution in mean j3 with redshift 
(Figure [7]), and almost all of the change in (3 (>75%) 
comes from a change in the dust content (Finlator 2011, 
private communication). 

Of course, it is always possible that the observed evo- 
lution in the C/l^-continuum slopes /3 might be due to 
a change in the overall dust composition or extinction 
curve with cosmic time, as might occur if the dust com- 
position depended on the age of the stellar population in 
a galaxy. Indeed, we might expect some change in the 
dust composition of galaxies as a result of the fact that 
dust from SNe would be expected to form much earlier in 
the lifetime of a galaxy than dust from AGB stars (e.g., 
Maiolino et al. 2004; Maiolino 2006; Reddy et al. 2006b; 
Gallerani et al. 2010; Finkelstein et al. 2012). While it 
seems clear that such changes may affect the colors of 
very yoimg star- forming galaxies, it is not clear how im- 
portant they are in driving the trends we observe with 
cosmic time. After all, even without considering such 
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Fig. 16. — (upper) The relationship between the star formation 
rate and stellar mass of galaxies at 2: ~ 4 (§5.5). The points cor- 
respond to the mean SFRs and median stellar masses for galax- 
ies with absolute UV magnitudes ranging from —21.0 mag and 
— 18.5 mag, in 0.5 mag intervals. The stellar masses are from 
Gonzalez et al. (2011) and the SFRs are derived from UV lumi- 
nosities adopting the dust corrections tabulated in Table [6] The 
error bars shown are random errors. The dotted error bar in- 
cluded to the lower right on the plot is representative of the typ- 
ical systematic error that likely applies to the SFR and stellar 
mass estimates. The blue line is a linear fit to the points, with 
SFR ~ (13l5M0/yr)(M./lo9MQ)O-''3±O.32^ rpj^g relationship be- 
tween SFR and stellar mass shown here is expected to be rep- 
resentative for galaxies that are luminosity selected. The stellar 
masses may be up to a factor of 2 higher using M/L ratios from a 
mass-based selection (but the exact correction depends upon un- 
certain details of the SF histories: see discussion in §5.5). (mid- 
dle) The relationship between the specific star formation rate and 
the stellar mass. The best-fit SFR-stellar mass relationship from 
the top panel is presented here in terms of the SSFR (black line: 
SSFR ~ (13l5Gyr-i)(M*/lo9M0)-O-27±o.32)^ rpj^^ ^^^^j. ^^^.^ 

are as in the top panel. The SSFRs may be up to a factor of 2 
lower using M/L ratios from a mass- based selection (see discussion 
in §5.5). (lower) Extinction correction (Lj n/ Ljjv + 1) we apply 
as a function of stellar mass. The extinction correction is based on 
the (7V-continuum slope /3 distribution observed and the Meurer 
et al. (1999) IRX-/3 relationship. The solid line is derived using the 
best-fit (5 and M/L ratio versus luminosity relationship at z ~ 4 
(Gonzalez et al. 2011). The error bars are as in the top panel. 
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Fig. 17.— Specific Star Formation Rate (SFR / A/*) for lu- 
minous, intermediate mass (~ 5 X 10® A^q) galaxies versus red- 
shift {solid blue circles: §5.6). The star formation rates are esti- 
mated using the canonical UV luminosity-to-SFR conversion fac- 
tors (Madau et al. 1 998; Kennicutt 1998) and the extinction factors 
given in Figure [161 Previous estimates of the specific star forma- 
tion rate from Noeske et al. (2007: open black squares), from Daddi 
et al. (2009: solid black squares), from Stark et al. (2009: open 
black circles), and Gonzalez et al. (2010: solid black circle) are 
also shown. The gray shaded regions provide an illustration of the 
rough systemic uncertainties in the SSFRs quoted in the literature. 
The red arrows show the changes in the SSFRs we would derive 
including the current estimates of the dust extinction at 2 ~ 4-7. 
Earlier measurements of the (/V-continuum slopes /3 at 2 ~ 5-7 
(Bouwens et al. 2009; Stanway et al. 2005) were sufficiently blue 
that no dust corrections were previously applied in estimating the 
SSFRs at high redshift. The SSFRs at z ~ 4-7 may be up to 
a factor of 2 lower using M/L ratios from a mass-based selection 
(but the exact correction depends upon uncertain details of the SF 
histories: see also discussion in §5.5 and Appendix B of Reddy et 
al. 2012b). The SSFR seems to evolve much more gradually with 
cosmic time in the observations than in many theoretical models, 
e.g., Neinstein & Dekel (2008: dotted line) and Dave (2008: dashed 
line), but see however Krumholz & Dekel (2011). The approximate 
SSFR-scaling in typical models scales as the specific accretion rate 
of gas (proportional to (1 -|- z)-^-^: Neinstein & Dekel 2008). 

changes in dust composition, the detailed cosmological 
hydrodynamical simulations of Finlator et al. (2011) are 
successful in reproducing the approximate evolution in /? 
we observe. 

5.5. Sequence in SF Galaxies: SFR - Stellar Mass 
Relationship 

In the previous sections, we saw that the overall shape 
of the SED for star-forming galaxies - both in the UV 
continuum slope and the UV-optical colors (Gonzalez et 
al. 2012) - exhibits a very similar dependence on lumi- 
nosity at all redshifts that we examined z ~ 4, 5, 6, and 7 
- suggesting there is a standard star-formation sequence 
for galaxies at high redshift. 

In this section, we derive an approximate relationship 
between the SFR and stellar mass of galaxies that re- 
side on the "star-forming sequence." To do this, we uti- 
lize the observed UV luminosities, ?7F-continuum slopes, 
and M/L ratios inferred from the observations. We 
transform the UV luminosities into SFRs by using the 
canonical Kennicutt (1998) and Madau et al. (1998) UV 
luminosity-to-SFR conversion factor. A dust correction 
is made at different UV luminosities using the Meurer 
et al. (1999) IRX-/3 relationship and the f/l^-continuum 
slope P distribution determined in this paper. As we have 
seen, these dust corrections are quite significant (~3x) at 
higher luminosities around i*^3, but are essentially zero 



at lower luminosities (< 0.1i*^3: > —18.5 AB mag). 

Stellar masses can be calculated using the luminosity- 
dependent M/L ratios derived by Gonzalez et al. (2011). 
Gonzalez et al. (2011) derived these M/L ratios utiliz- 
ing the HST optical + HST near-IR -I- Spitzer IRAC 
photometry for a large sample of z ^ 4 galaxies within 
the CDF-South ERS field (including those sources that 
are not individually detected in the IRAC observations) . 
Gonzalez et al. (2011) find that the M/L ratios scale as 
(M/Luv) 

(x L^y. Gonzalez et al. (2011) find a steeper 
dependence on luminosity than found by Stark et al. 
(2009), where the M/L ratio scale as (M/Luv) oc Lfj\J^. 
The z ~ 4 results should be fairly indicative of the re- 
sults at higher redshift given the lack of clear evolution 
in the M/L ratio from z ^ 7 to z ^ 4 for galaxies at a 
fixed UV luminosity {Mjjv ^ —22 to —18 mag: Stark et 
al. 2009; Labbe et al. 2010a,b; Gonzalez et al. 2011). 

The resulting SFRs and stellar masses for galax- 
ies in various UV luminosity bins are shown in the 
top panel of Figure [TH Fitting a line to these 
points in log-log space, we find that the SFR varies 
as (13t^M0yr-i)(M,/lO9M0)*'-'^3±o.32^ Without any 

dust correction, we find that the SFR varies as 
(6t^M0yr-i)(M*/lO9M0)O-59±O-32. The equivalent re- 
sults for the SSFR and dust extinction are shown in the 
lower two panels of Figure \W\ (see also §5.6). 

Interestingly the SFR vs. stellar mass relationship we 
derive including the effects of dust extinction is much 
more linear than we would derive without it. Without 
any dust correction, the results of Stark et al. (2009) 
and Gonzalez et al. (2011) imply that SFR oc M°-^^ and 
SFR oc M°-^^, respectively. However, correcting for dust 
extinction, these relationships become a much more lin- 
ear SFR oc M^-°^ and SFR oc M" '^^, respectively. An 
approximately linear proportionality, i.e., SFR vs. M*, 
is exactly what is expected from cosmological hydrody- 
namical simulations (e.g., Dave et al. 2006; Finlator et 
al. 2011; Dayal & Ferrara 2012). Indeed, it points to a 
scenario where galaxies build up exponentially with time 
along a well-defined star- forming sequence (e.g.. Stark et 
al. 2009; Papovich et al. 2011; Lee et al. 2011). 

We emphasize that the SFRs and stellar masses shown 
in Figure [16] (and also quoted for the "star-forming 
sequence" ) are representative values for a luminosity- 
selected sample. They were derived using the median 
M/L ratios found in specific bins of UV luminosity 
(Gonzalez et al. 2011). In general, one would expect 
different results for the M/L ratios using mass rather 
than luminosity-selected samples. Different results are 
also expected using mean rather than median M/L ra- 
tios. In particular, a mass selection would yield higher 
values for the M/L ratios (though the size of the effect 
will depend substantially on the scatter in the M/L ra- 
tios). Also use of mean rather than median M/L ratios 
for these calculations would increase the quoted masses. 
The reason is that medians will not account for the large 
amounts of mass in the tail of the distribution that ex- 
tends to high masses. Both effects work in the same sense 
and would tend to increase the masses of galaxies at each 
point on the sequence. Overall, by correcting for these 
effects, we would expect somewhat higher stellar masses 
for galaxies on the sequence and somewhat lower values 
of the SSFRs. The precise corrections depend, of course. 
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upon the duty cycle for star formation and scatter in 
the M/L ratios, but factor of ~2 corrections would not 
be surprising. Reddy et al. (2012b) include an extended 
discussion of the effect of source selection on observed 
SFR vs. stellar mass relations in their Appendix B and 
Figure 26. 



5.6. Sequence in SF Galaxies: Evolution in the SSFR 

The results of the previous section allow us to update 
previous estimates of the specific star formation rate at 
high rcdshift to include a correction for the dust extinc- 
tion. The specific star formation rate - the star formation 
rate divided by the stellar mass - has been of consider- 
able interest recently due to the evidence that the specific 
star formation rate may not evolve very rapidly at high 
redshift (Stark et al. 2009; Gonzalez et al. 2010), in signif- 
icant contrast to that expected from theory (e.g., Bouche 
et al. 2010; Dave 2010; Dutton et al. 2010; Weinmann et 
al. 2011; but see Krumholz & Dekel 2011). 

One shortcoming of these early SSFR determinations 
at high redshift (Stark et al. 2009; Gonzalez et al. 2010) 
was that no dust correction was applied (Gonzalez et al. 
2010). Such an approach seemed appropriate at the time, 
given the very blue [/V^-continuum slopes /? observed for 
galaxies in the redshift range z > 5 (Bouwens et al. 2009) 
and large uncertainties on those J/y-continuum slopes 
(e.g., Bouwens et al. 2009). However now that sufficiently 
deep, wide-area near-IR data are available we can estab- 
lish the J/y-continuum slopes /? and approximate dust 
corrections more accurately. 

Our new estimates of the dust extinction at z ^ 4-7 
allow us to correct previous SSFR estimates at z ~ 4-7. 
The results are presented in Figure [T71 Typical correc- 
tions result in a factor of ~2-3 increase in the SSFR. 

We emphasize that the above corrections to the SSFR 
at z > 4 are very schematic in nature. A proper de- 
termination of the SSFR in this regime requires a fairly 
extensive, self-consistent analysis of the observations. In 
addition to the issue of dust extinction, other issues that 
need to be considered are (1) the selection of the sam- 
ples by mass in contrast to selection by luminosity and 
(2) emission-line contamination of our broadband flux 
measurements (e.g., Schaerer & de Barros 2010). 

Already there are many theoretical predictions about 
how the SSFR should evolve with cosmic time (e.g., 
Bouche et al. 2010; Weinmann et al. 2011; Krumholz 
& Dekel 2011). How well do these expectations agree 
with our corrected SSFRs? The evolution of the SSFR 
in many models follows the specific accretion rate M /Al, 
which scales as (l-|-z)^-^ {dotted line on Figure[T7J Nein- 
stein & Dekel 2008; Weinmann et al. 2011). This implies 
a factor of ~10 decrease in the specific star formation 
rate from z ~ 7 to z ^ 2. By comparison, our revised es- 
timates of the dust extinction at z ^ 4-6 imply a SSFR 
that decreases by a factor of ^3 to z 2. While this 
still does not match the evolution predicted by standard 
models, the agreement is better. Are there additional in- 
gredients that might lead to further changes? One pos- 
siblity that has been discussed includes accounting for 
metallicity dependencies in one's SFR prescription (e.g., 
Krumholz & Dekel 2011) or changes to the stellar IMF 
(Dave 2010; Schaye et al. 2010). 
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Fig. 18. — Scatter in the (JV-continuum slope /3 distribution afj 
resulting from scatter in the dust extinction, metallicity, or in- 
stantaneous SFRs (see §5.7). The typical scatter in the intrinsic /9 
distribution (after subtracting the contribution due to photometric 
uncertainties) is ~0.34, as indicated by the cyan-shaded region (see 
Table[3]| . In estimating the approximate scatter in /3 that would re- 
sult from variations in the dust, metallicity, or star formation rate, 
we adopt a base stellar population model an E{B — V) of ^ 0.15, 
a [Z/Zq] of —0.7, a Salpeter IMF, and a constant star formation 
history. In considering variations in SF history (or instantaneous 
SFRs of galaxies), we break up the star formation history of each 
galaxy in ten 50 Myr-segments, treat each 50 Myr segment in the 
SF history as independent, and randomly choose a SFR for each 
segment from a log-normal distribution. It is evident from this 
figure that one can approximately match the observed scatter in 
the f/V-continuum slope distribution, by allowing for ~0.3 dex 
variations in the dust content or ^0.9 dex in the instantaneous 
SFR. Substantially larger variations in dust content or instanta- 
neous SFRs would introduce a larger scatter in the (/V-continuum 
slope 13 distribution - which is inconsistent with the observations. 

5.7. Sequence in SF Galaxies: Small Scatter in the j3 
Distribution 

The distribution of ?7T^-continuum slopes /3 shows a 
remarkably small intrinsic scatter ap for a fixed UV lu- 
minosity. The typical scatter ct/j observed is just ~0.34 
(Table |4|) . A similar scatter was found by Labbe et al. 
(2007) and Bouwens et al. (2009) for galaxy samples at 
z ^ 1-2.7 and z ~ 4, respectively. Such a small scatter 
allows us to set upper limits on variations in the star 
formation histories and dust content of galaxies (assum- 
ing that changes in one variable do not offset changes in 
other variables). 

Figure [18] illustrates the impact that scatter in dust, 
age, or metallicity would have on scatter in the /3 distri- 
bution. For the typical galaxy, we assume an E{B — V) of 
-0.15, a metalhcity [Z/Zq] of -0.7, a Salpeter IMF, and 
an approximately constant star formation history with 
some scatter in the star formation rate. These parame- 
ters are fairly representative for what has been found in 
stellar population models of luminous z ~ 2-4 galaxies 
(e.g., Papovich et al. 2001; Shapley et al. 2001, 2005; 
Erb et al. 2006b; Reddy et al. 2006b). 

We begin by using the observed scatter in the UV- 
continuum slope (3 distribution to set constraints on scat- 
ter in the ages (or star formation history) of star-forming 
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TABLE 7 

UV Luminosity Densities and Star Formation Rate Densities to —17.7 AB mag (0.05 L*^. 

SEE §6.1-§6.2).=' 



logj^Q£ logio SFR density 

Dropout (ergs s^^ i^Q Mpc~'^ yr""*^) 

Sample < z> Hz^i Mpc^^-j Uncorrected Corrected IncL ULIRG'^ 



B 3.8 26.38±0.05 -1.52±0.05 -1.21 ±0.05 -1.12 ±0.05 

V 5.0 26.08±0.06 -1.82±0.06 -1.54 ±0.06 -1.51 ±0.06 
i 5.9 26.02±0.08 -1.88±0.08 -1.72 ± 0.08 -1.71 ±0.08 
z 6.8 25.88±0.10 -2.02±0.10 -1.90 ±0.10 -1.90 ±0.10 

Y 8.0 25.65±0.11 -2.25±0.11 -2.13 ±0.11 -2.13 ±0.11 
J<1 10.3 24.ll!5;^ -3-8lo.| -3-8lo:7 -3-8^^7 
J"^ 10.3 <24.2<= < -3.7== < -3.7<= < -3.7'= 



^ Integrated down to 0.05 L*_^. Based upon LP parameters in Table 2 of Bouwens et al. (2011b: see 
also Bouwens et al. 2007) (see §6.1). The SFR density estimates assume > 100 Myr constant SFR 
and a Salpeter IMF (e.g., Madau et al. 1998). Conversion to a Chabrier (2003) IMF would result in 
a factor of ~1.8 (0.25 dex) decrease in the SFR density estimates given here. 
See §6.2 

Upper limits here are Icr (68% confidence). 

2 ~ 10 determinations and limits are from Oesch et al. (2012a: see also Bouwens et al. 2011a) and 
assume 0.8 z ^ 10 candidates in the first case and no z ~ 10 candidates (i.e., an upper limit) in the 
second case. 



t (Gyr) 




z 

Fig. 19. — Star formation rate density as a function of redshift (§6.1-6.2: see also TableEl. The lower set of blue points and blue-shaded 
region give the UV luminosity density {right axis) and hence the SFR density before correction for dust extinction. The upper set of 
red points and red-shaded region give the SFR density {left axis), after applying our estimated dust corrections (see Table IBJ. The UV 
luminosity density determinations are integrated down to —17.7 AB mag (0.05 i*— 3) and taken from Schiminovich et al. (2005: open black 
squares) at 2 < 1, Oesch et al. (2010c: solid black squares; see also Hathi et al. 2010) at 2 ~ 1-2.5, Reddy et al. (2009: green crosses) 
at 2 ~ 2-3, Bouwens et al. (2007, 2011b) at 2 > 4, and Oesch et al. (2012a) at 2 ~ 10. The dark red shaded region and dark dashed 
line include the contribution from IR bright sources (Magnelli et al. 2009, 2011; Daddi et al. 2009). The contribution from the IR bright 
population needs to be explicitly included in the SFR density estimates - since dust corrections do not typically recover the total SFRs for 
especially luminous > 10^^ Lq galaxies (e.g., Reddy et al. 2008). The IR bright population contributes very little to the SFR density at 
2 > 4 (e.g., Bouwens et al. 2009). Conversion to a Chabrier (2003) IMF would result in a factor of ~1.8 (0.25 dex) decrease in the SFR 
density estimates given here. 
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galaxies at high redshift. One potentiahy promising ap- 
proach is to consider star-formation history for galaxies 
with stochastic variations in the SFR over 50 Myr inter- 
vals (a typical time scale over which one might imagine 
the SFR in a galaxy might be correlated). Assuming 
similar variations in the SFRs of 2: > 4 galaxies to that 
observed at 2 ~ 0-1 (where a scatter of ~0.3 dex in ob- 
served the SFR-stellar mass relationship: Noeske et al. 
2007) results in just a 0.14 scatter in /3. We computed 
this scatter by (1) running a simulation with 1000 in- 
put galaxies, (2) dividing up the star formation history 
for each galaxy into ten 50-Myr segments, (3) randomly 
selecting a star formation rate for each 50-Myr segment 
from a log-normal distribution with 0.3 dex scatter, (4) 
computing the resultant /3's for each galaxy based on its 
star formation history, and (5) calculating the scatter in 
the derived (3 distribution for the simulated galaxies. For 
simplicity, each galaxy in the simulation is taken to have 
an age of 500 Myr. 

The predicted scatter in /3, i.e., 0.14, is considerably 
less than we observe. We can of course increase the pre- 
dicted scatter in l3 by considering star formation histories 
with larger variations in the SFRs. For example, a 0.9 
dex scatter in the SFRs translates into a 0.34 dex scatter 
in /3 - which is a good match to intrinsic scatter in /? 
{(Tp). This is similar to the scatter found by Gonzalez 
et al. (2011) in modelling the distribution of M/L ratios 
for z ~ 4 star-forming galaxies. In the above modeling, 
no account is made for changes to the total luminosity of 
galaxies, as a result of a stochastic star formation history. 

Scatter in the dust content can add significantly to the 
scatter in the ?7T^-continuum slope /3, but the magni- 
tude of the scatter will depend directly on how dusty 
galaxies are in the luminosity range one is considering. 
If the dust extinction is low, for example, dust has very 
little impact on the /3 observed, and therefore small mul- 
tiplicative changes to the dust extinction factor would 
have similarly little impact. On the other hand, if the 
dust extinction is non-negligible, multiplicative changes 
to the total dust extinction factor would have a big im- 
pact on the value of /3 one observes. For the fiducial 
luminous galaxy dX z ^ 2.5, with an E{B — V) ^ 0.15 
and a Calzetti et al. (2000) dust law, a ~0.3 dex scatter 
in the dust extinction would result in an observed scatter 
ap of ~0.34 (Figure [Hj). 

Scatter in the metallicity adds very little to scatter in 
the f/F- continuum slopes (Figure [T8|) . and therefore one 
cannot use the observed scatter in the C/y-continuum 
slope (3 distribution to set strong limits on scatter in 
galaxy metallicity. 

In summary, the observed scatter in the C/y-continuum 
slopes (3 distribution allows us to set upper limits on vari- 
ations in dust content and instantaneous SFR of galax- 
ies (assuming that changes in one variable do not offset 
changes in other variables). Scatter in the dust extinc- 
tion of galaxies appear to be <0.3 dex and scatter in the 
instantaneous SFR is ^0.9 dex. 

6. STAR FORMATION RATE DENSITY AT HIGH 
REDSHIFT 

The availability of WFC3 /IR data over both the ultra- 
deep IIUDF09 fields and wide-area fields has allowed us 
to establish the C/y-continuum slopes to great accuracy 
over a wide range in redshift and luminosity. In the pre- 



vious section, we used these C/T^-continuum slope dis- 
tributions to estimate the mean dust extinction in star- 
forming galaxies at high redshift. 

Here we utilize these new estimates of the dust ex- 
tinction to revisit our determinations of the SFR density 
at high redshift. In §6.1, we begin by first determin- 
ing the SFR density from those galaxies that make up 
our high-redshift LEG selections. In §6.2, we include 
the contribution from ultra- luminous (I/f,oi > lO"*^^ Lq) 
IR-bright galaxies (§6.2). We include this contribution 
explicitly since dust corrections tend to underestimate 
the SFRs for the the most luminous, IR-bright galaxes 
(e.g., Reddy et al. 2009) and since such galaxies are not 
typically well-represented in rest-frame UV, LBG-type 
selections. Finally, in §6.3, we compare our total SFR 
density estimates with what we would infer from current 
measures of the stellar mass density. 

6.1. SFR Density at High Redshift 

In this subsection, we determine the SFR density using 
our current estimates of the dust extinction. As in pre- 
vious work (Bouwens et al. 2009; Bouwens et al. 2011b), 
we base our SFR density determinations on our most 
recent LF determinations at z ~ 4-8 (Bouwens et al. 
2007; Bouwens et al. 2011b) and search results at z ~ 10 
(Bouwens et al. 2011a; Oesch et al. 2012a). Luminosity 
densities are derived by integrating these LFs down to 
-17.7 AB mag (0.05 i*=3) which is the limit to which 
we probe the LFs at both z ~ 7 and z ~ 8. These lu- 
minosity densities are then converted into SFR densities 
using the canonical Madau et al. (1998) and Kennicutt 
et al. (1998) relation: 

Luv = f TT^l X lO^ergs s"! Hz-^ (3) 

where a 0.1-125 Salpeter IMF and a constant star 
formation rate of > 100 Myr are assumed. Finally, for 
our dust extinction estimates, we will use those from Ta- 
ble H calculated using the Meurer et al. (1999) IRX-/3 
relationship. 

Our latest UV luminosity density and SFR density es- 
timates are summarized in Table [7] and presented in Fig- 
ure [191 The new estimates are in broad agreement with 
previous estimates (Bouwens et al. 2009), but we find a 
lower SFR density at z ~ 4, as expected, given the lower 
dust extinction we infer at these redshifts. The change is 
significant, with the SFR density decreasing by a factor 
of ^1.5-2 at this redshift (see also Castellano et al. 2012). 

6.2. SFR Density Estimates including IR Luminous 
galaxies 

Dusty, infrared-luminous galaxies also potentially con- 
tribute quite meaningfully to the SFR density at high 
redshift. However, it can be quite challenging to account 
for this contribution on the basis of optical/near-IR sur- 
veys with the Hubble Space Telescope. Not only can it 
be difficult to identify such sources in these surveys (due 
to their faintness in the UV or red colors which cause 
them to be excluded from LBG selections), but it is now 
well established that the dust extinction for the most 
dusty, infrared luminous (>10^'^ Lq) galaxies at z ~ 1-3 
cannot be accurately estimated using the observed UV- 
continuum slopes (3 and IRX-/3 relationship (e.g., Reddy 
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Fig. 20. — A comparison of the derived SFR density with that implied by stellar mass density determinations in the literature {hatched 
red region: Labbe et al. 2010b; Gonzalez et al. 2011: §6.3). Published estimates of the stellar mass density at 2 ~ 8 (Labbe et al. 201 0b) 
are very uncertain at present and therefore not used to infer a SFR density at 2; > 8. The SFR density determinations are as in Figure 1191 
and Table[7] Conversion to a Chabrier (2003) IMF would result in a factor of ~1.8 (0.25 dex) decrease in the SFR density estimates given 
here. Good agreement is observed between the SFR. density and that implied by the stellar mass density. The agreement is much better 
than in Bouwens et al. (2011b), as expected given the improvements in the l/V-continuum slope /3 determinations at 2 > 4 (and therefore 
likely dust corrections). Note that we actually require a dust correction to obtain good agreement between the SFR density and that implied 
by the stellar mass density. 



et al. 2006b; Elbaz et al. 2007). 

To accurately account for the SFR density in this pop- 
ulation, a better approach is to include that population 
explicitly by utilizing a luminosity function in the mid- 
IR/far-IR and integrating down to 10^^ Lq (e.g., Reddy 
et al. 2008; Reddy & Steidel 2009; Bouwens et al. 2009). 
We replicate that approach here integrating published 
mid-IR LFs to lO^^L©, converting the total IR luminos- 
ity to SFR using the canonical relation in Kennicutt et 
al. (1998), and then adding the inferred SFR densities to 
the dust-corrected UV SFR densities. The IR luminos- 
ity functions we utilize are Magnelli et al. (2009, 2011) 
to z - 2 and Daddi et al. (2009) at z ~ 4. Magnelh et 
al. (2011) utilize the full set of deep 24^m and 70^m ob- 
servations over the two GOODS fields from the FIDEL 
program. Since the 70/im and 24/im data that Magnelli 
et al. (2011) utilise are significantly deeper than that used 
by Caputi et al. (2007) and allow for a self-consistent cor- 
rection to the total bolometric luminosity, the Magnelli 
et al. (2011) SFR density estimate represents a notewor- 
thy improvement on the Caputi et al. (2007) estimates 
we previously utilized at 2 ~ 2 (Bouwens et al. 2009). 

Including the contribution from IR bright galaxies, we 
present our total SFR density estimates in Table [7] and 
Figure 1191 Interestingly, but not surprising, this cor- 
rection makes very little difference to the SFR density 
derived at very high redshifts z > 4, but adds modestly 
to the SFR density at late cosmic times (z < 3). A small 
contribution of ULIRGs to the SFR density at high red- 



shifts is expected given their position at the very end of 
the extended build-up process whereby galaxies gradu- 
ally acquire higher and higher masses in gas, dust and 
stars (e.g., Bouwens et al. 2009). 

6.3. SFR Densities Implied by Stellar Mass Density 
Measurements 

As in our previous work (Bouwens et al. 2011b), we can 
compare our SFR density estimates with that implied by 
recent stellar mass density determinations (e.g.. Stark 
et al. 2009; Gonzalez et al. 2011; Labbe et al. 2010a,b; 
Gonzalez et al. 2011). In doing so, we consider the inte- 
grated SFR density and stellar mass density to the same 
luminosity limits 6.05 L^^^ for self consistencvF^ 

We use the following formula to infer an approximate 
SFR density (SFRD) at 2; > 4 from the observed stellar 
mass density (SMD): 



SFRD(z„Zj) 



SMD(z,) - SMD(zj) 
time(zi) — time(zj) 



{l~e)-'fLE (4) 



where Zi < Zj are the redshifts of adjacent Lyman-break 
samples and e is the gas recycling factor. The 1 — e 
factor accounts for the recycling of gas mass from high- 
mass stars back into the interstellar medium through 

The importance of using a consistent limit for this comparison 
was made by Reddy & Steidel (2009) in regards to claims that SFR 
density determinations at 2: ~ 2-4 might not be consistent with 
stellar mass determinations in the same redshift range. 
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SNe explosions. Recycling results in only a fraction of 
the stars formed being locked up in stellar mass, i.e., 
dM^/dt = (1 — e)SFR where e = 0.3 appropriate for a 
Salpeter IMF (e.g., Bruzual & Chariot 2003). The /lb 
factor in the above equation accounts for the fact that 
new galaxies enter our magnitude-limited samples at all 
redshifts simply as a result of galaxy growth. Since these 
galaxies (and their stellar mass) would not have been in- 
cluded in the magnitude-limited sample just above them 
in redshift, these sources would cause us to overestimate 
the SFR density (see §7.4 of Bouwens et al. 2011b). Ac- 
counting for this latter effect reduces the inferred SFR 
density by a factor of 1.3; therefore, we take /le to be 
1/1.3. 

The SFR densities implied by several recent stellar 
mass density determinations (e.g.. Stark et al. 2009; 
Gonzalez et al. 2010; Labbe et al. 2010a, b; Gonzalez et al. 
2011) are presented in Figure [20l and there is remarkably 
good agreement over the redshift range z ^ 4-6. This is 
a useful consistency check and suggests that the dust ex- 
tinctions we are inferring at z ~ 4-7 are reasonable and 
fit into a consistent picture. The general agreement we 
observe also points towards no clear evolution in the stel- 
lar initial mass function (IMF) to high redshift {z > 4: 
see also Bouwens et al. 2011b; Papovich et al. 2011) since 
any changes in the IMF would have an effect on the SFR 
density or stellar mass densities we compute from the 
observed light and likely result in a mismatch. 

The observed agreement also suggests that biases in 
our stellar mass density estimates at high redshift, e.g., 
due to the contamination of the rest-frame optical light 
probed by IRAC with strong emission lines (Schaerer & 
de Barros 2010), are not huge. Of course, we cannot to- 
tally rule out modest levels of contamination by emission 
lines, particularly to the SEDs of z ^ 6-7 galaxies, and in 
fact there may be some evidence in the stacked SEDs of 
z ~ 5-7 galaxies that emission lines do have some effect 
on the IRAC fluxes (Gonzalez et al. 2012). 

7. SUMMARY 

The recent availability of ultra-deep WFC3/IR ob- 
servations over the HUDF, CDF-South GOODS, and 
the two IIUDF05 fields has allowed us to measure UV- 
continuum slopes for large samples of star-forming galax- 
ies at z ~ 4-7. Such measurements can be made both for 
samples of z ~ 4-7 galaxies to very faint levels (i.e., —17 
AB mag) and also for much brighter samples (i.e., —21 
AB mag). Use of the Lyman-break selection technique 
allows us to divide these sources by redshift into four dis- 
tinct redshift samples (z ~ 4, z ~ 5, z ~ 6, and z ^ 7) 
and thus to quantify the changes in the t/T^-continuum 
slope /3 with cosmic time from '--^0.7 Gyr to 1.8 Gyr after 
the Big Bang. 

This is the first time the f/V^-continuum slope /3 dis- 
tribution can be derived with such small uncertainties 
for a large sample of z ~ 4-7 galaxies (~2500 galaxies). 
Full use of the flux information in the f/F-continuum is 
made in determining the t/y-continuum slope /3 for indi- 
vidual sources (§3.3), except of course those bands con- 
taminated by emission from Lya or affected by the posi- 
tion of Lyman or Balmer breaks (at ■~1216A or ~3600A, 
respectively). This resulted in > 1.5 x smaller uncertain- 
ties in our measurements of /3 than obtained using other 
techniques (see Appendix B.3). It is essential to keep 



the uncertainties in our /3 measurements to a minimum 
if we are to accurately characterize the scatter in the /3 
distribution. 

Care was taken to minimize the effect of source selec- 
tion and photometric scatter on our results (Appendix 
B). Such effects can significantly bias determinations of 
the t/F-continuum slope distribution (see e.g., Dunlop 
et al. 2012), so it is crucial to utilize techniques that 
minimize the bias. We estimate the bias using exten- 
sive Monte-Carlo simulations where we added artificial 
sources to the observations and then select and measure 
their properties in the same way as the real observations. 
We demonstrate that we can recover the distribution of 
C/y-continuum slopes /? to very faint magnitudes with 
very small biases (^0.1: see Figure [5] and [54)) . The very 
small biases found here appear to be in significant con- 
trast to the large biases found by techniques that use 
similar information both to select sources and measure 
their ?7F-continuum slopes (3 (see Figures [TUl and I^S)) . In 
particular, as we demonstrate through extensive simula- 
tions (Appendix D), a coupling between source selection 
and /3 measurement seems to have produced the large bi- 
ases reported by Dunlop et al. (2012: §4.6) towards mea- 
suring blue slopes (since sources with blue slopes show a 
greater likelihood to be at z > 5). In addition, we find 
that we can select galaxies with J/^-continuum slopes as 
red as 0.5 (Figure [2] and [21]), so the [/^-continuum slope 
distributions we derive should be valid over a wide range 
in [/l/-continuum slope (3 (i.e., —3.5 to 0.5). Small cor- 
rections were made based on the selection biases found 
in our simulations. 

Using the above procedure, we accurately establish the 
distribution of J/y-continuum slopes /3 over a wide range 
in both redshift and luminosity. This is the first time this 
has been possible to do so with such precision, and we 
use these J/y- continuum slope /3 distributions to make 
inferences about how the dust properties of galaxies likely 
vary with both luminosity and redshift. We then use 
these results to derive a SFR versus stellar mass sequence 
for galaxies at z ~ 4, to intepret the evolution of specific 
star formation rate with cosmic time, and to compare the 
SFR density results at z ~ 4-8 with that inferred from 
the stellar mass density. 

Here are our primary findings: 

• Galaxies at high redshift lie along a well-defined 
{ap ~ 0.34) UV-co\oT versus magnitude sequence 
at all redshifts under study (z ^ 4-7). Previously 
Labbe et al. (2007) and Bouwens et al. (2009) pre- 
sented evidence for similar sequences at z ~ 1-3 
and z ~ 2.5-4. 

• The biweight mean 17 continuum slope /3 shows 
an approximately linear relationship with UV lu- 
minosity in all four redshift intervals we examine, 
i.e., z ~ 4, z ~ 5, z ~ 6, and z ~ 7. The mean 
/3 of higher luminosity galaxies is redder than that 
found for lower luminosity galaxies in all four sam- 
ples. We demonstrate that this trend is not an arti- 
fact of source selection in our analysis, contrary to 
the suggestion of Dunlop et al. (2012). See §4.6 and 
Appendix B. Similar trends were found by Labbe 
et al. (2007), Overzier et al. (2008), Bouwens et al. 
(2009), Bouwens et al. (2010a), Finkelstein et al. 
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(2010), Lee et al. (2011), and Wilkins et al. (2011: 
§4.4). 

No statistically-significant evolution in the slope 
of C/T^-continuum slope /3 - luminosity sequence is 
found over the entire redshift range z 4-7 (see 
Figure[7|) . Particularly striking are the slopes of the 
/? - luminosity relationship at z ~ 4, z ~ 5, and z ~ 
6 where the slopes are -O.lliO.Ol, -0.16±0.03, 
and — 0.15±0.04, respectively. The slope of this 
relationship at z ~ 7, i.e., — 0.21±0.07, is consis- 
tent with that at lower redshift (later times). The 
derived slopes to the /3-luminosity relationship are 
in good agreement with previous results at z 3- 
5 (e.g., Bouwens et al. 2009; Overzier et al. 2008; 
Wilkins et al. 2011). 

We observe an evolution in the intercept to the 
f/y-slope /3 vs. luminosity relationship with cosmic 
time, in the sense that higher redshift galaxies of a 
given W luminosity are bluer than lower redshift 
galaxies of the same UV luminosity (see Figure [7]). 
While such an evolution in colors had already been 
found over the redshift range z ^ 5-6 to z ^ 3- 
4 (Lehnert & Bremer 2003; Stanway et al. 2005; 
Bouwens et al. 2006, 2009, 2010a), this confirms 
this result at much higher confidence. We remark 
that this evolution in the colors might have been 
expected based on the evolution seen in the dust 
extinction vs. bolometric luminosity relationship 
from z ^ 2 to z (e.g., Reddy et al. 2006b, 
2010; Buat et al. 2007) and also the evolution seen 
in the mass-metallicity relationship from z ^ 3.5 to 
z ~ (e.g., Tremonti et al. 2004; Erb et al. 2006a; 
Maiolino et al. 2008; Mannucci et al. 2009; Laskar 
et ah 2011: see §5.4). 

We observe similarly red [/^-continuum slopes /3, 
i.e., —1.6 to —1.9, for the most luminous galaxies 
(and presumably most massive) in all four redshift 
samples examined here (similarly red values were 
also found by Lee et al. 2011 and Willott et al. 
2012). The similarity of the observed /3's for the 
most luminous galaxies (and presumably the most 
massive) underscores the potential importance of 
mass in setting the properties of individual galaxies 
(see also Finkelstein et al. 2012). 

We find further evidence that the mean UV- 
continuum slope /3 of faint z ^ 6-7 galaxies is 
very blue, i.e., < —2.5 (§4.8). After correcting 
for the relevant biases (Appendix B), the biweight 
mean C/T^-continuum slope /3 we measure for — 18 
AB mag sources is — 2.5±0.2 and — 2.7±0.2 at z '-^ 6 
and z ^ 7, respectively. While very blue over- 
all (relative to the typical galaxy at low redshift), 
these /3's are not inconsistent with what one can 
achieve with conventional stellar population mod- 
elling and therefore do not require exotic stellar 
populations to explain. Instead, the blue /3's ap- 
pear to be as expected given the observed trends 
in /3 versus both redshift and luminosity. Our mea- 
sured /3's confirm earlier results by Bouwens et 
al. (2010a), Bouwens et al. (2010b), Oesch et al. 



(2010a), Bunker et al. (2010), and Finkelstein et 
al. (2010). 

• We can use the intrinsic scatter in the UV- 
continuum slope /3 distribution (Table S]) to set 
limits on variations in the dust extinction or in- 
stantaneous SFR of galaxies on the "star-forming" 
sequence (§5.7). The inferred scatter of up ~ 0.34 
in the [/^-continuum slope /3 distribution corre- 
sponds to a maximum scatter of 0.3 dex in the dust 
extinction of galaxies and 0.9 dex in the instanta- 
neous SFRs. 

• We argue that changes in C/F-continuum slope /3 
as a function of redshift and luminosity are primar- 
ily driven by changes in the mean dust extinction 
of galaxies (§5.2, §5.4: see e.g. §4.5 of Bouwens 
et al. 2009). Using the current determinations of 
the ?7F-continuum slope distribution at z ~ 4-7 
and the IRX-/3 relationship at z ~ (Meurer et al. 
1999; Overzier et al. 2011), we estimate the approx- 
imate dust extinction of galaxies as a function of 
luminosity at z ~ 4, 5, 6, and 7 (see Figure see 
§5.3). We find that the dust extinction for galaxies 
at lower luminosities and high redshift is essentially 
zero (Figure [Til and §5.3: e.g., see also Bouwens et 
al. 2009). 

• We find good agreement between the SFR density 
inferred from stellar mass density estimates and 
that inferred from the dust-corrected UV observa- 
tions over the redshift range z ~ 4-7 (Figure [20l 
§6.3). The agreement is better than that found by 
Bouwens et al. (2011b) using the Bouwens et al. 
(2009) estimates of the dust correction. 

• We have used our new estimates of the dust extinc- 
tion and mass-to-light estimates from the literature 
(e.g.. Stark et al. 2009; Gonzalez et al. 2011) to 
reexamine the relationship between the SFR and 
stellar mass Af* of galaxies (§5.5). We find that 
the SFR is proportional to M^o '^3±o.32 (Figure [i6l 
see also Labbe et al. 2010a). The exponent to this 
relationship is much closer to 1.0 including a cor- 
rection for dust extinction than not including it. 
This relationship is therefore plausibly close to the 
simple proportionality, i.e., SFR oc Af,, expected 
in many cosmological hydrodynamical simulations 
(e.g., Finlator et al. 2011). 

• Our new estimates of the dust extinction imply 
higher values of the SSFR at z '-^ 4-7 for the in- 
termediate mass (^ 5 X 10^ galaxies where 
this dependence was quantified (§5.6). The implied 
change in the SSFRs at z > 4 is approximately a 
factor of ~2-3 higher than before. With the im- 
plied changes to the SSFR, the SSFR at z 5-7 is 
therefore plausibly higher than that at z < 2 (Fig- 
ure [iTl) and hence plausibly evolves from z ~ 4-7 to 
z 2. The observed evolution is therefore in a sim- 
ilar sense to what is expected in many theoretical 
models (e.g., Bouche et al. 2010; Dave 2010; But- 
ton et al. 2010; Weinmann et al. 2011; Krumholz 
& Dekel 2011). 
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The star-forming (or mass-metallicity) sequence identi- 
fied here from the well-defined, redshift-independent /3 
vs. luminosity relation can provide us with powerful con- 
straints on the build-up and evolution of galaxies at early 
times. The modest scatter and uniform slope for this se- 
quence suggest a scenario in which galaxies build up and 
evolve in a uniform manner for most of early cosmic time. 
Luminous galaxies on this sequence are redder than lower 
luminosity galaxies, due to their larger dust extinction. 
Lower redshift galaxies are also redder, at all luminosities 
again most likely due to their higher dust content. 

Given the limited size and S/N of z ^ 7 samples, the 
t/y-continuum slope distribution at z ^ 7 is more poorly 
defined than at z < 6 and would benefit from even deeper 
observations than are currently available. Such observa- 
tions would be valuable not only for reducing the size 
of current photometric errors, but also for extending our 
samples to fainter magnitudes at high S/N. 

In addition, we look forward to more WFC3 /IR obser- 
vations over the '^800 arcmin^ CANDELS fields (Grogin 
et al. 2011; Koekemoer et al. 2011). These wide-area ob- 
servations will allow us to accurately establish the UV- 
continuum slope distribution to high luminosities where 
the rarity of sources has made such determinations diffi- 



cult in the past. 
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APPENDIX 

A. COMPARISONS WITH C/y-CONTINUUM SLOPE MEASUREMENTS BASED ON A SINGLE UV COLOR 

In the present study, we determine J/y-continuum slopes /? from a fit to all available flux observations in the rest- 
frame UV . Only passbands sufficiently redward of the Lyman Break are included to avoid contamination from Lya 
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Fig. 21. — Relative volumes available for selecting z ~ 4, 2 ~ 5, 2 ~ 6, and z ~ 7 galaxies versus (/V-continuum slope /3, given our 
Lyman-Break selection criteria {Figure|2]l. This selection volume is calculated for the HUDF09 fields over the magnitude ranges 25.0-27.0 
mag {left panel) and 27.0-28.5 mag {right panel). While our selections are more efficient at selecting galaxies with bluer (7V'-continuum 
slopes /3, these selections are also effective in identifying galaxies to quite red C/V-continuum slopes /3. 

emission or IGM absorption (below ^^1216 A). The goal is to use all available flux information to constrain the UV- 
continuum slope and to adopt the most extended possible wavelength baseline. This allows us to minimize the errors 
on our t/y-continuum slope measurements. 

One potential difficulty with this approach is that the wavelength baseline used to estimate the ?71^-continuum 
slope varies somewhat depending on the sample. The wavelength baseline adopted for z ~ 4 galaxies, for example, is 
different from the wavelength baseline used for z '--^ 6 galaxies (e.g., see Figure S]) and similarly for galaxies at z ^ 5 
and z ^ 7 (see also Table [3]). Because of these different wavelength baselines, we could potentially measure different 
C/y-continuum slopes for galaxies at different redshifts even if there is no intrinsic evolution in the underlying galaxy 
populations themselves. 

To determine the extent to which our varying wavelength baseline biases our C/y-continuum slope measurements, 
we also estimated C/F-continuum slopes /3 for sources in our samples using the wavelength baseline I6OOA to 2200A . 
All galaxies in our z ~ 4-7 samples have coverage at these wavelengths and so the J/y-continuum slope measurements 
can be made in a consistent way across all four samples. Because of the very limited wavelength ranges involved here, 
we derive the C/y-continuum slopes from 2-3 rest-frame UV bands. We have derived the following formula to convert 
the observed colors to the equivalent [/^-continuum slope /3. The formula are as follows: 
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These formulae are similar to those already presented in Bouwens et al. (2010a), Dunlop et al. (2012), and Wilkins et 
al (2011). 

To quantify possible systematics, we compared the [/V^-continuum slopes /3 derived using our fiducial approach 
(using the full flux information from the f/V^-continuum) and that estimated from our Lyman-Break samples using 
the above formula. We found that the median C/y-continuum slope /3 using our fiducial approach is offset by just 
0.03, -0.15, 0.06, and 0.0 relative to that determined by Egs. [AIIIMI for our z ~ 4, z ~ 5, z 6, and z ~ 7 samples, 
respectively. These offsets are comparable in size than the A/3 ~ 0.10-0.28 uncertainties we estimate to be present in 
the derived f/y-continuum slopes (3 based upon potential systematics in the photometry. We would therefore expect 
no large biases in our UV -continuum slope measurements as a result of our use of a variable wavelength baseline. Our 
measurements of the J/y-continuum slope /3 should thus be both reliable and have smaller uncertainties. 

B. CORRECTIONS TO THE (/V-CONTINUUM SLOPE DfSTRIBUTION 

Our measurements of the J/y-continuum slopes /3 are, of course, subject to a variety of selection and measurement 
biases (e.g., Bouwens et al. 2009; Dunlop et al. 2012). These biases include a preferential selection for sources with very 
blue [/^-continuum slopes /3 and the effect of noise in artificially increasing the scatter in the measured C/y-continuum 
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Fig. 22. — Biases in the mean t/V-continuum slope /3 distribution expected to arise from /9-dependent selection effects {dashed lines: 
Appendix B.1.1) and from any coupling of the photometric errors in the measurement process to source selection {dotted lines: Appendix 
B.1.2). The panels present the mean (/V-continuum slope /3 we would expect to find from our observed Lyman-I3reak samples given an 
input (7y-continuum slope /3 distribution with a mean /3 of —2. The results are presented as a function of the rest-frame UV magnitudes 
of the sources. The biases expected using the "robust" photometric redshift selection of Dunlop et al. (2012; Dll) are also included {red 
circles: from their figure 8). It is remarkable how poorly the photometric redshift techniques of Dunlop et al. (2012) perform relative to 
the present approach. See Figure [TOl §4.6, Appendix B.1.2, and Appendix D for a brief explanation for the differences between the present 
approach and that employed by Dunlop et al. (2012). In general, we expect extremely small biases in the mean {/V-continuum slopes /3 
derived from our samples, i.e., A/3 < 0.1, relative to the actual values. Of course, at very faint magnitudes, the biases are a little larger, 
but they are still much smaller than the biases suffered by photometric redshift techniques where source selection and /3 measurements are 
tightly coupled. 

slopes. Since these effects can have a significant effect on the derived /3 distribution, we cannot draw scientifically 
useful conclusions unless we determine the magnitude of these biases on the C/y-continuum slope /3 distribution and 
apply a correction. 

In this appendix, we quantify the approximate effect that source selection and photometric error coupling bias have 
on the C/y-continuum slope distribution (Appendix B.f ) and then derive a correction for these biases (Appendix B.2). 
In Appendix B.3, we quantify the effect of photometric errors in increasing the overall scatter in the [/^-continuum 
slope distribution. Finally, in Appendix B.4, we compare the UV-contimiwoa slopes f5 distribution we derive from 
ultra-deep and wide-area data sets to ensure that our results show a basic self consistency. This is important for 
verifying that all the relevant biases are understood and correctly treated. 



B.l. Potential Biases in the Mean /3 

We would expect small biases in the mean f/y-continuum slope /? we measure from our observed samples. These 
biases arise whenever galaxies with specific properties - intrinsic or observed - are easier to select than others. We 
discuss two such biases: (1) one based on the fact that galaxies with certain intrinsic colors are easier to select than 
others (Appendix B.1.1) and (2) one arising from the fact that the information used to measure /? is not always 
independent from that used to select the sources (Appendix B.1.2). 

The bias described in Appendix B.1.1 arises because not all intrinsic values for /3 are selected with the same efficiency, 
and the bias described in Appendix B.1.2 arises because the average measured value for /3 is not always equal to the 
intrinsic value. 

The purpose of Appendix B.l is to provide a description of the relevant biases and their approximate size for the most 
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Fig. 23. — Distribution of {/^-continuum slopes /3 for faint (27.0-28.5 mag) galaxies in the three HUDF09 fields {blue histogram). For 
compa riso n, the selection volume available to identify galaxies versus {/V-continuum slope is also included as the black lines (see also 
Figure I2TI 1. The selection volumes presented here are normalized to match the number of galaxies found near the peak of distribution. 
This figure is similar to Figure 5 of Bouwens et al. (2009). A clear deficit of star- forming galaxies with very red C/y-continuum slopes is 
observed. This figure demonstrates that this deficit is almost certainly real and is not simply the result of selection biases. 




25 26 27 28 29 
m,R (AB) 

Fig. 24. — Scatter in the observed (yV-continuum slope /3 distribution expected to originate from photometric errors ("noise"). The 
plotted results are for (7V-continuum slope /3 determinations over the HUDF. The scatter is determined by adding artificial sources to the 
observations and then recovering these sources and measuring their (/V-continuum slopes. Differences between the input f/V-continuum 
slopes (5 and that recovered from the observations are used to quantify the approximate scatter in the t/V-continuum slope distribution 
introduced by noise. The scatter we find in /3 is much lower for lower redshift samples than for our highest redshift sample. We have 
the longer wavelength baseline and larger number of broadband filters available to establish the {/V-continuum slope /3 at lower redshift. 
The scatter in our l/V-continuum slope /3 measurements for our z ~ 4-5 samples is ~1.5x smaller using the full flux information in the 
rest-frame UV SED (i.e., i775Zg5oyi05>^l25 for our 2 ~ 4 sample and 2;850^105''^125^^160 for our 2 ~ 5 sample) than using information in a 
single UV color. 
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Fig. 25. — C/y-continuum slope /3 versus UV luminosity for our z ~ 4, z ~ 5, 2 ~ 6, and 2 ~ 7 samples. The red points are sources 
from our HUDF09 samples and the black points are from our ERS+CANDELS samples. The large solid squares show the biweight mean 
C/V-continuum slope measured in 1.0 mag intervals. The error bars give the Itr uncertainties on the biweight mean. The red and black 
squares are horizontally offset for clarity (to avoid direct overlap with each other). The blue lines show the best-fit relationship between 
(JV-continuum slope /3 and luminosity presented (§3.6). Excellent agreement is observed between the mean /3 determinations in our ultra- 
deep HUDF09 and those in our wide-area ERS+CANDELS selections in the luminosity range where the determinations overlap. This 
suggests that the /3 distribution we derive from our shallower data sets is fair and any possible residual biases (not accounted for in our 
calculated corrections: Appendix B.l- Appendix B.3) are small. 



simplistic cases (i.e., if the /3 distribution is a delta function). In Appendix B.2, we calculate these biases assuming a 
more realistic input distribution for (3. 

B.1.1 "Selection Volume Bias": Biases Related to the Intrinsic Selectability of Sources 

In general, we would expect our high-redshift selections to be more effective in identifying galaxies with certain 
[/T^-continuum slopes than others - resulting in small biases in the mean t/l^-continuum slope /3 derived from our 
samples. Since bluer galaxies are almost always easier to select than redder galaxies in Lyman-break galaxy selections, 
this bias would be towards bluer colors. This effect is often referred to as "template bias" in the literature. 

To quantify the extent to which object selection affects the distribution of t/y-continuum slopes recovered from the 
observations, we run extensive Monte-Carlo simulations. In these Monte-Carlo simulations, we insert artificial galaxies 
into the real data with a wide variety of luminosities and colors and then reselect these galaxies using the same method 
as we used on the real sources. We then use these simulations to determine how the selection efficiency and hence 
selection volume depends upon the intrinsic f/V^- continuum slope /3. A detailed description of these simulations is 
given in Appendix C. 

Figure [m provides an illustration of the basic results from these simulations in the magnitude ranges 25.0 - 27.0 
mag and 27.0 - 28.5 mag. Indeed we find the expected trends in the simulations. The selection efficiency is largest 
for galaxies with the bluest J7X^-continuum slopes. As in other Lyman-break selections, the selection volumes show a 
sizeable decrease towards redder f/F-continuum slopes, especially for /3 > 0.0. 

The effect of these biases on the mean C/T^-continuum slope (3 derived from the observations however are relatively 
modest for typical samples. For example, assuming the intrinsic distribution of [/1^-continuum slopes /3 has a mean /3 
of —1.5 and Icr scatter of 0.3 (see TableS]), the bias in the mean /3 is just A/3 ~ 0.1 for a ■~28 mag galaxy in our z ^ A 
selection. We estimate the size of this bias by starting with an initial distribution of /?'s, weighting this distribution 
by the effective volume at each /3 (for a given magnitude), and then computing the shift in the biweight mean /3. 



36 



Figure [H] shows the effect of these selection biases on the mean [/^-continuum slope /3 found in our z ^ 4-7 
selections. The biases shown in Figure [22] assume that the intrinsic /? distribution has a mean value of —2.0 and Icr 
scatter of 0.3. While we note a bias towards bluer [/T^-continuum slopes /3, these biases are not extraordinarily large. 
Similarly small selection biases are found in our other data sets, i.e., the two other HUDF09 fields, the ERS field, and 
the CDF-South CANDELS field. 

Comparing the number of sources observed with a given /3 with the computed selection volumes for this /3 gives us 
another means of assessing the importance of this bias. As can be seen in Figure [52] for our HUDF selections, we 
can select sources to much redder /3's than generally observed in our selections. As in our direct estimates of the bias 
(Figure [22)) . this suggests that the selection effects are only having a modest effect on our derived /3's. 

We emphasize that the biases presented in this subsection and the next are simply intended to be illustrative; more 
realistic estimates of the bias (and those values of the bias we will use to correct the observations) are presented in 
Appendix B.2. 

B.1.2. Photometric Error Coupling Bias 

The intrinsic selectability of sources is not the only cause for biases in the mean [/V^-continuum slope measurements. 
For many samples, biases can arise as a result of the fact that source selection is done on the basis of the same 
photometric information as used for the [/1^-continuum slope /3 measurements. This can be mildly problematic, since 
the C/V^-continuum slope /3 measurements are then coupled to the selectability of a source. If errors in the photometry 
of a source make the source easier to select, any effect these same photometric errors have on the measured /3 results 
in biases. Dunlop et al. (2012) show how significant this effect can be using their own photometric redshift selection 
as a reference (Figure [T0|) . 

The size of this effect can be determined using the same simulations as described above. The measured values of 
/3 (after selection) are compared with the input values of /?. Differences between the mean ?7y-continuum slope f3 
measured for the selected galaxies and that input into the simulations is the bias. Throughout we will refer to this 
bias as the "photometric error coupling bias." 

In Figure [22] we show the mean t/y-continuum slopes (3 we recover for our z ~ 4-7 selections in the HUDF09 field 
assuming an input C/T^-continuum slope /3 of —2. Strikingly, the output distribution of f/F-continuum slopes /? we 
find is almost exactly the same as our input distribution, so it is immediately clear that biases resulting from noise are 
small. Nonetheless, we do note a small bias in the recovered [/V^-continuum slope distribution towards redder slopes. 

The explanation for both of these effects is simple. The bias is small because we use a completely different region 
of the UV SED (including the two bluest rest-frame UV bands) for selecting galaxies than we use to measure the 
[/T^-continuum slope (the second bluest UV band and redder). The effect of photometric scatter on the selection of 
sources is therefore largely independent of the effects of this scatter on C/T^-continuum slope /? estimates. 

The bias towards redder C/V^-continuum slopes occurs as a result of the small overlap between the bands used 
for selection and those used for measurement of the [/V^- continuum slopes. Since this overlapping band is the long 
wavelength anchor for the selection, any scatter towards fainter values makes the source bluer in the ?71^-continuum and 
therefore easier to select. However, since this overlapping band is also the short wavelength anchor for C/F-continuum 
slope (3 estimates, this same scatter towards fainter values results in redder values for the [/^-continuum slope /3. 

B.2. Corrections to the observed UV -continuum slope distribution 

In Appendix B.l, we estimated the approximate bias in the mean f/V^- continuum slope /3 we would determine from 
the observations. We began by estimating the bias that would result from the fact that galaxies with bluer UV- 
continuum slopes /? are easier to select than galaxies with redder C/F-continuum slopes (Appendix B.1.1). We then 
estimated the bias we would expect in the measured values for /3 as a result of the slight coupling that occurs between 
source selection and the measurement process (Appendix B.1.2). Of course, both biases affect the mean [/l^-continuum 
slope f3, so we need to add the above biases together to obtain the total bias in the t/F-continuum slope /3. The results 
are presented in Figure [22] (so/id lines). 

Remarkably enough, the approximate bias in the mean t/y-continuum slope /? is almost zero. The bias towards 
bluer slopes (from the intrinsic selection biases: B.1.1) largely offsets the bias towards redder slopes (from the slight 
coupling between source selection and /3 measurements: B.1.2), resulting in a very small bias overall. Given the very 
small size of the biases expected here for our faint samples, it is certainly striking how large these same biases are 
for the "robust" photometric redshift selection of Dunlop et al. (2012). Figures [lU] and [^ provide a rather dramatic 
illustration of the differences. The contrast in biases is especially noteworthy, especially given the large number of 
candidates in our faint samples. 

While we can calculate the approximate effect of the aforementioned biases on /3 given some input distribution, for 
the simulations to be accurate we must use the true underlying /? distribution for these inputs. Since we are not able 
to establish the underlying /3 distribution without first establishing the biases (so we can apply a correction to the 
observed distribution of /3's), it was necessary to follow an iterative approach. The observed /3 served as a starting 
point for the computation of the biases. In each iteration, we used the previous best estimate of the ?71^-continuum 
slope /3 distribution to establish the relevant corrections until we obtained convergence. 

Biases in the mean t/l^-continuum slope were estimated in a very similar way for each of our samples (z ^ 4, z ^ 5, 
z ^ 6, and z ~ 7). 



37 



B.3. Biases in the la Scatter Measured for the /3 Distribution 

Flux measurement errors act to significantly broaden the C/T^-continuum slope f5 distribution. The added scatter 
can be substantial, particularly near the selection limits for our samples. Typical uncertainties on individual UV- 
continuum slope /3 measurements range from 0.4 to 1.0, with the largest uncertainties being relevant for z 7 galaxies 
where only two passbands ( J125, ^^leo) are available to make the measurement and the wavelength baseline is relatively 
short. An illustration of the extent to which these errors can increase the apparent scatter in the /3 distribution can 
be found in Bouwens et al. (2009: Figure 4). 

To quantify the extent to which flux measurement error ( "noise" ) increases the spread in the J/V-continuum slope (3 
distribution, we again rely on the simulations described in Appendix B.l (see also Appendix C). For these simulations, 
we assume that all galaxies have exactly the same J/y-continuum slope (3, with no scatter, and then measure the 
scatter in the recovered distribution. The derived la scatter is shown in Figure [M] for z ~ 4-7 galaxies identified in 
the HUDF observations. Scatter is largest for faint sources in each of our z ^ 4-7 samples and also in our z ^ 6-7 
samples. Scatter in our lower redshift samples is smaller because of the much larger number of passbands and longer 
wavelength baselines over which we use to measure the [/^-continuum slopes f3 (Tableland §3.3). 

Another benefit of having run these simulations is that they allow us to quantify the extent to which we can reduce 
the overall uncertainties in /3 using the full flux information in the UV continuum, rather than using just a single color 
(§3.3 and Appendix A). These gains are most dramatic for z ^ 4 and z ^ 5 galaxies in our flelds where we use the 
information in four passbands (Table |3]) rather than just two (Appendix A). In the z ^ 4 or z ^ 5 cases, for example, 
we find a factor of '^1.5 reduction in the scatter. 

B.4- Comparisons between the HUDF09 and ERS+CANDELS Determinations 

In the previous sections, we outlined the procedure we use to determine the distribution of C/y-continuum slopes (3 
for high-redshift galaxy samples, after correcting for various selection and measurement biases. 

An important check on these results is to compare the distributions we derive at different depths. How well does the 
distribution of C/T^-continuum slopes /3 obtained from the ultra-deep HUDF09 data set agree with that obtained from 
the wide-area ERS-I-CANDELS observations? The results are shown in Figure [51] for all four of our Lyman-Break 
selections. While the agreement is excellent overall, this should perhaps not come as a great surprise - given the very 
small corrections that need to be made to each of our selections (Figure [22)) . 

C. SIMULATION PROCEDURE USED TO ASSESS BIASES 

The [/l/-continuum slope /3 distributions we find are sensitive to a wide variety of object selection and measurement 
issues. To determine the approximate size of these effects, we ran an extensive set of Monte-Carlo simulations where 
we inserted artificial galaxies into the real data, reselected them and measured their properties, all in the same way as 
done for our real samples (§3). Galaxies are randomly included in the simulated images over the full redshift window 
where we might conceivably select them, i.e., z ^ 2.9-5.0 for our z ~ 4 samples, z ^ 4.0-6.0 for our z ~ 5 samples, 
z ~ 5.0-7.0 for our z ~ 6 samples, and z ^ 6.0-8.0 for our z ~ 7 samples. For these simulations, we start with real 
pixel- by-pixel images of similar luminosity galaxies from the z ~ 4 HUDF sample of Bouwens et al. (2007). The z ~ 4 
HUDF sample of Bouwens et al. (2007) implicitly defines the size and morphology distribution used in our simulations 
for z ^ 4 galaxies, both in terms of the absolute sizes, the size-luminosity relation, and also the width of the size 
distribution. Sizes of the sources are scaled as (1 -I- z)~^ to match the observed size-redshift trends (e.g., Bouwens et 
al. 2004; Ferguson et al. 2004; Buitrago et al. 2008). Such a size scaling results in a close reproduction of the actual 
size distribution of galaxies in the z ~ 7-8 observations (Oesch et al. 2010b; Appendix A of Bouwens et al. 2011b). 
We have utilized such techniques extensively in the last decade to establish the reliability of our selections, to quantify 
biases in measured magnitudes and colors, and to establish the needed corrections (e.g., Bouwens et al. 2003, 2004, 
2006, 2007, 2008, 2009, 2011b). 

D. BIASES IN MEASURING /3 USING A PHOTOMETRIC REDSHIFT PROCEDURE? 

While we offer one approach to establishing the t/V-continuum slope /3 distribution, a complementary approach was 
put forward by Dunlop et al. (2012). In this approach, high-redshift sources are selected using a photometric redshift 
procedure and their ?71^- continuum slopes /? are estimated directly from their observed colors. Using this methodology, 
Dunlop et al. (2012) found no dependence of /3 on luminosity or redshift - which is in contrast to the present results 
and most results in the literature (§4.3-§4.4). 

One of the most interesting claims Dunlop et al. (2012) made using their approach was that measurements of the 
C/T^-continuum slope /3 at faint magnitudes were likely unreliable, and hence it was not possible to determine the 
mean f/F-continuum slope /3 to very faint magnitudes. Dunlop et al. (2012) demonstrated these significant biases by 
inserting high-redshift sources into their catalogs with specific C/y-continuum slopes /3, adding noise to the photometry, 
and then deriving the mean C/T^- continuum slope (3 for the selected sources. Dunlop et al. (2012) found that galaxies 
at fainter magnitude levels, particularly >28 mag, were biased to much bluer [/V^- continuum slopes (3 than the sources 
they put into their simulations. These biases resulted from the effect of noise in perturbing the colors of sources in their 
catalogs and these same colors being used for both source selection and (3 measurements. This is the same photometric 
error coupling bias we discussed in Appendix B.l. 2. 

Are such biases really as unavoidable as Dunlop et al. (2012) suggest? To investigate this, we wrote software to 
approximately replicate the procedure by Dunlop et al. (2012). We then ran Monte-Carlo simulations involving more 
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Fig. 26. — Mean f/V-continuum slope /3 we recover versus IR magnitude at 2 ~ 7 using a photometric redshift selection procedure and 
input C/V-continuum slopes /3 of —2 (see Appendix D). The procedure we use to select galaxies and measure their C/V-continuum slopes 
/3 here is designed to be similar to that used by Dunlop et al. (2012). Shown are the results selecting sources using the 5- band BVizY 
photometry (blue), 6-band BVizYJ photometry (green), and the full 7-band BVizY J H photometry (red). Our results for the full 7-band 
photometry seem very similar to that obtained by Dunlop et al. (2012: solid red circles). The mean (7\/-continuum slope /3 we recover 
using five or six-band photometry is much less biased than what we recover using the full seven-band photometry. The reason a 7-band 
selection results in biased measures of the f/V-continuum slope /3 is that similar information (i.e., the J and //-band fluxes) is used (1) to 
measure their C/y-continuum slopes /3 and (2) to select the sources. Because of this coupling, the 7-band photometric redshift selection 
excludes as unreliable precisely those sources which are reddest in their observed UV slopes — biasing the slope results. This is the same as 
the "photometric error coupling bias" discussed in Appendix B.1.2, and it becomes very large at low S/N. Signiflcantly less biased measures 
of the mean f/y-continuum slope /3 can be obtained using flve or six-band photometric redshift selections (due to the independence of the 
information being used for source selection and the measurement of /3). These simulations effectively show that f/V-continuum slopes /3 
can be recovered accurately to very faint flux levels, given the appropriate technique. 



than 6 x 10'' sources. Each source was assigned a random i?i6o-band magnitude between 25 AB mag and 30 AB mag, 
keeping the intrinsic /? fixed for ah sources in a simulation run. Fluxes for individual sources were computed based 
on the input /3's assuming power-law SEDs. Noise was added to the fluxes of sources in the simulations assuming 
HUDF09-depth data. Finally, sources were selected using a similar photometric redshift procedure to that employed 
in Dunlop et al. (2012) and McLure et al. (2011). This involved comparing the measured fluxes for each source with 
the fluxes from various template SEDs to determine the minimum at each redshift and then calculating the redshift 
likelihood function P{z). 

For SED templates, we used the four templates from Coleman et al. (1980) and the two bluest starburst templates 
from Kinney et al. (1996), linearly interpolating between adjacent templates. While this provides us with slightly less 
freedom in fitting arbitrary SEDs than is available to Dunlop et al. (2012) - who utilising the Bruzual & Chariot 
(2003) spectral synthesis code consider a wide range of ages, metallicities, and dust extinctions - the SED template 
set we consider covers a sufficient range to illustrate the relevant biases in /3. No prior was assumed. Sources were 
selected if (1) the primary redshift solution is preferred at >95% confidence over the secondary redshift solution (i.e., 
X^(best-fit, high-z) < x^(best-fit, low-z) -f 4, (2) >50% of the integrated redshift likelihood / P{z) is at z > 6, and (3) 
the best-fit is statistically acceptable (typically < 10 given the number of flux constraints and free parameters 
in the SED modeling). Following Dunlop et al. (2012), we estimate the C/y-continuum slope /3 for sources in the 
simulations from the observed J125 — ^^leo colors, using the expression /3 = 4.43( J125 — i?i6o) — 2.00. 

The results of our simulations are shown in Figure [H] for initial f3 values of —2.0 and —2.5. Not surprisingly, we 
obtain similar results with our simulations {red line) as what Dunlop et al. (2012) obtained (solid red circles). The 
simulations indicate a strong bias in the recovered J/y-continuum slopes (/?) at faint magnitudes towards bluer slopes. 
This is particularly true at > 28 mag where the recovered /3 is 5/3 ~ 0.8 bluer than the intrinsic value. As we show 
below, these biases depend critically on the passbands used to select sources. This bias likely originates from a close 
coupling of the information used to select sources and that used to measure /? (Appendix B.1.2). This coupling is 
problematic because any effect that noise has in making the apparent UV slopes bluer would also cause objects to be 
more amenable to selection via the Dunlop et al. (2012) procedure. 

Is it possible for us to recover the C/y-continuum slope /3 in a less biased fashion? The answer is yes, but it requires 
that we use independent information to select the sources from what we use to measure the [/V^-continuum slopes - 
thereby decoupling the effect of noise on the two processes. We can do this by repeating the above photometric redshift 
selection, but only using the flux information in the five passbands that do not contribute to our t/T^-continuum slope 
estimates, i.e., the i?435, Vgoe, ^775, ^850, and F105 bands. We also consider a selection using the flve passbands that 
do not contribute plus the bluest remaining band, i.e., the i?435, Veoe, 1775, zsso: ^105 : and J125 bands. The results 
are shown in Figure 1261 and it is clear that the mean C/y-continuum slope /? we measure is less biased, especially in 
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TABLE 8 

A COMPLETE LIST OF THE UV-CONTINUUM SLOPES /3 WE MEASURE FOR 
SOURCES IN OUR 2: ~ 4, 2: ~ 5, 2 6, AND Z ^1 SAMPLES FROM THE 

HUDF09+ERS+CANDELS data sets'' 



R.A. 


Declination 




/3 


<z> 


Data Sct'^ 


03:32:38.49 


-27:48:21.41 


-17.82 


-1.92±0.24 


4 


1 


03:32:38.42 


-27:48:18.68 


-16.46 


-1.61±0.57 


4 


1 


03:32:37.67 


-27:48:16.87 


-17.49 


-2.19±0.37 


4 


1 


03:32:37.31 


-27:48:13.74 


-18.41 


-2.42±0.19 


4 


1 


03:32:38.14 


-27:48:12.75 


-17.82 


-1.35±0.34 


4 


1 



^ Table [8] is published in its entirety in the electronic edition of the Astro- 
physical Journal. A portion is shown here for guidance regarding its form 
and content. 

^ The data set from which the source was selected and in which its UV- 
continuum slope beta derived (1 = HUDF09, 2 = HUDF09-1, 3 = HUDF09- 
2, 4 = CANDELS/ERS) 

the five passband case. Indeed, the mean C/F-continuum slope /3 can be successfuhy recovered to ~29 AB mag, even 
in the six passband case. This provides a clear demonstration that it is possible to measure the mean UV -continuum 
slope P to very faint flux levels, given the appropriate technique. 

We minimize the biases by excluding those passbands used for the j3 measurements from the selection process (i.e. 
excluding the reddest one or two passbands from the photometric redshift determinations) . Are there any disadvantages 
to this approach? While one might argue that this results in some loss of ability to control for contamination, such 
losses are small - as one can see from the following considerations: (1) The only secure characteristic we know about 
high redshift star-forming galaxies is that they disappear at wavelengths bluer than ~1216A and are detected blueward 
of that in the rest-frame UV . Use of five or six-band photometry to compute photometric redshifts allows us to verify 
that sources have this characteristic. (2) Selection of high-redshift sources can be improved by excluding those sources 
with the reddest colors redward of the break. Only one color is necessary for this task. For a z ~ 7 selection, this means 
a 6-band BVizYJ photometric redshift selection (the same bands are used for our z ~ 7 Lyman-Break selection). 
Adding a 7th band to the photometric redshift selection might appear to help, but actually does not, as this results 
in significantly biased UV slope (3 measurements (see Figure I26p . (4) Even if a small number of contaminants make 
it into one's samples one can mitigate the effect they have on the mean ?7T^-continuum slope (3 through the use of 
robust statistics like the biweight mean (see §3.5). Thus there appears to be few disadvantages to using an approach 
that minimizes the biases. The gains in measurement reliability greatly offset any potential concerns. 

E. r/V-CONTINUUM SLOPE /3 MEASUREMENTS FOR INDIVIDUAL SOURCES 

In the interests of transparency and to facilitate comparisons with other high-redshift studies, we include a complete 
list of the [/1^-continuum slopes j3 that we derive for all 2518 individual sources in our z 4, z ~ 5, z ~ 6, and z 1 
samples in Table [S] 

We note that a small fraction (<1%) of the sources in our samples have very red UV-continuum slopes /?, i.e., > 0.5, 
and therefore may not be at high redshifts (since this would imply that the apparent break at ~ 1 micron in their SEDs 
may not be due to absorption by neutral hydrogen, but due to their overall spectral shape). We should emphasize, 
however, that these sources have essentially no effect, i.e., df3 < 0.01, on the biweight-mean /3's we report. 



